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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1912 
CCATAACCAT GAGTGATAAC AC 22 

2) INFORMATION FOR SEQ ID NO: 1913 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1913 
CGCCTTGATC ATTGGGAACC 20 

25 

2) INFORMATION FOR SEQ ID NO: 1914 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1914 
40 CGCCTTGATC GTTGGGAACC 20 



45 



2) INFORMATION FOR SEQ ID NO: 1915 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1915 

CGCCTTGATA GTTGGGAACC 20 

5 

2) INFORMATION FOR SEQ ID NO: 1916 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear' 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1916 
CGTGGGTCTT GCGGTATCAT 20 

20 

2) INFORMATION FOR SEQ ID NO: 1917 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1917 
35 CGTGGGTCTG GCGGTATCAT 20 



2) INFORMATION FOR SEQ ID NO: 1918 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1918 

50 

GTGGGTCTCA CGGTATCATT G 21 
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2) INFORMATION FOR SEQ ID NO: 1919 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1919 
CGTGGGTCTC TCGGTATCAT T 



2) INFORMATION FOR SEQ ID NO: 1920 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1920 
CGTGGNTCTC GCGGTATCAT 



2) INFORMATION FOR SEQ ID NO: 1921 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1921 
CGTGGGTCTA GCGGTATCAT T 



2) INFORMATION FOR SEQ ID NO: 1922 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1922 
GTTTTCCAAT GATTAGCACT TTTA 24 



2) INFORMATION FOR SEQ ID NO: 1923 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1923 
GTTTTCCAAT GATAAGCACT TTTA 24 

25 

2) INFORMATION FOR SEQ ID NO: 1924 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1924 
40 GTTTTCCAAT GCTGAGCACT TTT 23 



2) INFORMATION FOR SEQ ID NO: 1925 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1925 
CGTTTTCCAA TGATGAGCAC TTT 
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2) INFORMATION FOR SEQ ID NO: 1926 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single ■ 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1926 
GTTTTCCAAT GGTGAGCACT TTT 



2) INFORMATION FOR SEQ ID NO: 1927 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: MC9690-129 

(C) ACCESSION NUMBER: AF126482 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1927 

ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 
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GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 
GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 
TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 
AGCATTGGTA A 



750 
800 
850 
861 
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2) INFORMATION FOR SEQ ID NO: 1928 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE:, Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: HB251 

(C) ACCESSION NUMBER: X57972 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1928 

ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTAAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 4 00 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCATTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



2) INFORMATION FOR SEQ ID NO: 1929 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 
5 (B) STRAIN: 26W 

(C) ACCESSION NUMBER: U09188 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1929 



10 ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAGCT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC, GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

15 CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACCCGCCTT GATAGTTGGG AACCGGAGCT 500 

20 GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA' 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCGGTAA GCGTGGATCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

25 GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATG 808 



30 2) INFORMATION FOR SEQ ID NO: 1930 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: AF190692 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1930 

45 

ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

50 GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 250 

■ CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 400 
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ACTTACTTCT 
CACAACATGG 
GAATGAAGCC 
TGGCAACAAC 
TCCCGGCAAC 
ACTTCTGCGC 
GAGCCGGTGA 
GGTAAGCCCT 
TATGGATGAA 
AGCATTGGTA 



GACAACGATC 
GGGATCATGT 
ATACCAAACG 
GTTGCGCAAA 
AATTAATAGA 
TCGGCCCTTC 
GCGTGGGTCT 
CCCGTATCGT 
CGAAATAGAC 
A 



GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 
AACTCGCCTT GATCGTTGGG AACCGGAGCT 
ACGAGCGTGA CACCACGATG CCTGCAGCAA 
CTATTAACTG GCGAACTACT TACTCTAGCT 
CTGGATGGAG GCGGATAAAG TTGCAGGACC 
CGGCTGGCTG GTTTATTGCT GATAAATCTG 
GGCGGTATCA TTGCAGCACT GGGGCCAGAT 
AGTTATCTAC ACGACGGGGA GTCAGGCAAC 
AGATCGCTGA GATAGGTGCC TCACTGATTA 



450 
500 
550 
600 
650 
700 
750 
800 
850 
861 
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2) INFORMATION FOR SEQ ID NO: 1931 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: BM2728 

(C) ACCESSION NUMBER: AF104442 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1931 

ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCGGTGA GCGTGGGTCT CTCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



2) INFORMATION FOR SEQ ID NO: 1932 



1008 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D)' TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
L0 (A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: AF190695 

(xi) SEQUENCE DESCRIPTION: SEQ,ID NO: 1932 

15 ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGCT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

20 CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 4 00 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

25 GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCGGTGA GCGTGGGTCT AGCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

3 0 GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1933 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 861 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE :. Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: AF190693 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1933 

ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 
TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 
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CTGAAGATCA GTTGGGTGCA CGAGTGGGTT 
AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC 
GAGCACTTTT AAAGTTCTGC TATGTGGTGC 
CCGGGCAAGA GCAACTCGGT CGCCGCATAC 
5 GTTGAGTACT CACCAGTCAC AGAAAAGCAT 
AAGAGAATTA TGCAGTGCTG CCATAACCAT 
ACTTACTTCT GACAACGATC GGAGGACCGA 
CACAACATGG GGGATCATGT AACCCGCCTT 
GAATGAAGCC ATACCAAACG ACGAGCGTGA 

10 TGGCAACAAC GTTGCGCAAA CTATTAACTG 
TCCCGGCAAC AATTAATAGA CTGGATGGAG 
ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG 
GAGCCGGTGA GCGTGGATCT CGCGGTATCA 
GGTAAGCCCT CCCGTATCGT AGTTATCTAC 

15 TATGGATGAA CGAGATAGAC AGATCGCTGA 
AGCATTGGTA A 



PCT/CA00/01150 

ACATCGAGCT GGATCTCAAC 150 

GAAGAACGTT TTCCAATGGT 200 

GGTATTATCC CGTGTTGACG 250 

ACTATTCTCA GAATGACTTG 300 

CTTACGGATG GCATGACAGT 350 

GAGTGATAAC ACTGCGGCCA 400 

AGGAGCTAAC CGCTTTTTTG 4 50 

GATCGTCGGG AACCGGAGCT 500 

CACCACGATG CCTGCAGCAA 550 

GCGAACTACT TACTCTAGCT 600 

GCGGATAAAG TTGCAGGACC 650 

GTTTATTGCT GATAAATCTG 700 

TTGCAGCACT GGGGCCAGAT 750 

ACGACGGGGA GTCAGGCAAC 800 

GATAGGTGCC TCACTGATTA 850 

861 



20 2) INFORMATION FOR SEQ ID NO: 1934 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1934 

GAACGCCAGC GCGAAATTCA AAAAG 25 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1935 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1935 
AGCTCGGCAT ACTTCGACAG G 21 

2) INFORMATION FOR SEQ ID NO: 1936 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY-: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1936 
TACCACCCGC ACGGC 



2) INFORMATION FOR SEQ ID NO: 1937 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1937 
CGGAGTCGCC GTCGATG 



2) INFORMATION FOR SEQ ID NO: 1938 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1938 
CCGCGCACCA TTGCTTCGTA CACTGAGGAG TCTCCGCGCG G 



2) INFORMATION FOR SEQ ID NO: 1939 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1939 

5- 

CGACCCGGAT GGTAGTATCG ATAATGATCC GCCAGCGGCC GGGTCG 4 6 



LO 2) INFORMATION FOR SEQ ID NO: 1940 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 
L5 (C) STRANDEDNESS: Single 

(D) TOPOLOGY:- Linear 

(ii) MOLECULE TYPE: DNA 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1940 

GTATCGTTGG TGACGTAAT 19 



25. 



35 



20 



2) INFORMATION FOR SEQ ID NO: 1941 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1941 
GCAATGGTCC GTTTAAGT 18 

2) INFORMATION FOR SEQ ID NO: 1942 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1942 
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GACTGGAACA AAGCCTATAA AAAATCA 



2) INFORMATION FOR SEQ ID NO: 1943 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1943 
GCTGGTGGAC GGCCAG 



2) INFORMATION FOR SEQ ID NO: 1944 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1944 
TTTCGCCGCC ATGCGTTAC 



2) INFORMATION FOR SEQ ID NO: 1945 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1945 
CGGCGACTAC GCGGTAT 



2) INFORMATION FOR SEQ ID NO: 1946 
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(i)' SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1946 
CGGCGACTTC GCGGTAT 



2) INFORMATION FOR SEQ' ID NO: 1947 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1947 
CGGTATACGG CACCATCGT 



2) INFORMATION FOR SEQ ID NO: 1948 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1948 
GCGGTATACA ACACCATCG 



2) INFORMATION FOR SEQ ID NO: 1949 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1949 
CGGTATACGC CACCATCGT 



2) INFORMATION FOR SEQ ID NO: 1950 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1950 
GGCGACATCG CCTGC 



2) INFORMATION FOR SEQ ID NO: 1951 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1951 
GGCGACAGAG CCTGCTA 



2) INFORMATION FOR SEQ ID NO: 1952 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1952 
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CCTGCTATGG AGCGATGGT 19 



2) INFORMATION FOR SEQ ID NO: 1953 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 21 bases 
L0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



L5 



10 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1953 
CGCCTGCTAT AAAGCGATGG T 21 

2) INFORMATION FOR SEQ ID NO: 1954 



' (i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 589 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

J5 (C) ACCESSION NUMBER: AF052258 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1954 

ACACCGGTCA ACATTGAGGA AGAGCTTAAG AACTCTTATC TGGATTATGC 50 

10 GATGTCGGTC ATTGTTGGCC GTGCGCTGCC GGATGTCCGA GATGGCCTGA 100 

AGCCGGTACA CCGTCGCGTA CTTTACGCCA TGAACGTATT GGGCAATGAC 150 

TGGAACAAAG CCTATAAAAA ATCAGCCCGT GTCGTTGGTG ACGTAATCGG 200 

TAAATACCAC CCGCACGGCG ACTCCGCGGT ATACGACACC ATCGTGCGTA 250 

TGGCGCAGCC GTTCTCGCTG CGTTACATGC TGGTGGACGG CCAGGGTAAC 300 

15 TTTGGTTCCA TCGACGGCGA CTCCGCCGCG GCGATGCGTT ATACCGAAAT 350 

TCGTCTGGCG AAAATCGCTC ATGAGCTGAT GGCCGATCTT GAAAAAGAGA 400 

CGGTCGATTT CGTCGACAAC TATGACGGTA CGGAGCGTAT TCCGGACGTC 450 

ATGCCGACCA AAATTCCTAA CCTGCTGGTG AACGGCGCCT CCGGGATCGC 500 

CGTAGGGATG GCCACCAACA TACCGCCACA TAACCTGACG GAAGTGATTA 550 

50 ACGGCTGTCT GGCGTATGTT GACGATGAAG ACATCAGCA 589 
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2) INFORMATION FOR SEQ ID NO: 1955 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1955 
CGCGCAACCA TTGCTTCGTA CACTGAGGAG , TCTGCGCG 



2) INFORMATION FOR SEQ ID NO: 1956 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 989 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Candida inconspicua 

(B) STRAIN: ATCC 16783 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1956 

CGGTAAAACC ACTTTAACTG CTGCAATCAC CAAAGTTTTA TCTGAAGAAG 50 

GTGGTGCAGA TTTCTTAGAT TATTCATCTA TTGATAAAGC ACCAGAAGAG 100 

AGAGCTAGAG GTATTACCAT TTCTACTGCT CATGTTGAAT ATGAAACTCC 150 

AAACAGACAT TATTCACATG TTGATTGTCC AGGTCATCAA GATTATATTA 200 

AGAACATGAT TACTGGTGCA GCTCAAATGG ATGGTGCTAT TATTGTTGTT 250 

GCAGCTACTG ATGGTCAAAT GCCACAAACT AAGGAACATT TATTATTAGC 300 

TAGACAAGTT GGTGTTCAAC ATTTAGTTGT TTTTGTTAAC AAGTGTGATA 350 

CCGTTGATGA TCCAGAAATG TTRGAATTAG TTGAAATGGA AATGAGAGAA 4 00 

TTATTGACCG AATATGGATT YGATGGTGAT AACACACCAG TTATTATGGG 450 

TTCTGCATTA ATGGCATTAG AAGGTAAGAG ACCAGAAGTT GGTAAAGAAT 500 

CTATTGTTAA GTTAATGGAA GCTGTTGATA CTTGGATTCC AACTCCACAA 550 

AGAGACTTAG AAAAACCATT CTTATTACCA ATTGATGAAG TTTTCTCTAT 600 

TTCTGGTAGA GGTACTGTTG TTTCTGGTAC TGTTGATAGA GGTACATTAA 650 

AGAAGGGTGA AGAAGTTGAA ATTGTTGGTG GTAAAGAAGG TGTTATTAAG 700 

ACTACTGTTA CCGGTATTGA AATGTATCAC AAGGAATTAG ATCAAGCACA 750 

AGCTGGTGAT ACTCCAGGTA TTTTGTTAAG AGGTGTTAAG AGAGATCAAA 800 

TTGCAAGAGG TCAAATTCTT GCAAAGCCAG GWTCTGTTAA GGCATACAAG 850 

AAGTTCTTAT CATCATTATA CATTTTAACA AAGGAAGAAG GTGGTAGACA 900 

TACTCCATTT TCTGAAAATT ACAGACCTCA AATGTACATT AGAACTTCCA 950 

ATGTTAATGT TACTTTGAAG TTCCCAGAAA CTGAAGAAG 989 



1017 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1957 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 991 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida vtilis 

(B) STRAIN: ATCC 22023 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1957 

GGTAAGACCA CCCTTACTGC CGCCATCACC AAGTGCCTTG CTGAGAAGGG 50 

AGGTGCCTCG TTCTTGGACT ACAGTGCCAT CGACAAGGCG CCAGAGGAGA 100 

GAGCAAGAGG TATCACCATC TCCACTGCGC ACGTTGAGTA TGAAACTGCC 150 

2 0 AACAGACACT ACTCGCACGT TGACTGTCCA GGTCACGCTG ATTACATCAA 200 

GAACATGATT ACCGGTGCTG CGCAGATGGA CGGTGCTATC ATTGTCGTTG 250 

CAGCCACTGA CGGTCAGATG CCACAGACCA GAGAACACTT GTTGCTTGCC 300 

AGACAAGTTG GTGTCCAGCA CATTGTTGTC TTCGTCAACA AGGTTGACAC 350 

CATCGACGAC CCTGAGATGC TTGAGCTTGT TGAAATGGAG ATGAGAGAGT 400 

25 TGCTTACTTC GTATGGATTT GACGGTGATA ACACCCCAGT TATCATGGGT 450 

TCTGCTTTGT GTGCTTTGGA AGGCCGTGAG CCAGAGATTG GTGCTAAGGC 500 

CATTGACAAG TTGATGGAGG CCATTGATGA GTACATCCCA ACTCCTCAGA 550 

GAGACCTGGA AAAGCCATTC YTGATGGGTG TTGAAGACGT GTTCTCGATC 600 

TCTGGTAGAG GTACCGTTGT CACRGGCCGT GTTGAGCGTG GTAACTTGAA 650 

30 GAAAGGTGAT GAAATTGAAC TTGTTGGCTA CAACAAGAAC CCAATCAAGA 700 

CCACCGTCAC CGGTATCGAA ATGTTCAAGA AGGAGTTGGA ATCTGCCATG 750 

GCTGGTGACA ACTGTGGTAT CTTGTTGCGT GGTATCAAGA GAGATGACGT 800 

CAAGAGAGGT ATGGTTGCTG CTAAGCCAGG CTCCGTCTCT GCACACACCA 850 

AGTTCCTCGC TTCCTTGTAC ATCCTGACRA AGGAGGAAGG TGGTCGTCAC 900 

35 AGTGCCTTTG CTGAGAACTA CAGACCACAG ATGTTCATCA GAACCGGAGA 950 

TGTCACCACC ATCTTGACAT GGCCAGAGGA GCACGCTGAC C 991 



40 2) INFORMATION FOR SEQ ID NO: 1958 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 

(B) STRAIN: ATCC 7351 
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CGGTAAGACC ACTTTGACCG CCGCCATCAC CAAGGTGTTG AGCGCCAAAG 50 

GTGGTGCTTC CTTCTTGGAC TACGGGTCCA TCGACAGAGC CCCTGAGGAG 100 

5 AGAGCCAGAG GTATTACTAT CTCGACTGCC CACGTTGAGT ACGAGACCGA 150 

TAAGAGACAC TACGCCCACG TTGATTGCCC TGGTCACGCT GATTACATCA 200 

AGAACATGAT CACTGGTGCC GCCCAAATGG ACGGTGCCAT TATTGTCGTT 250 

GCTGCTTCTG ATGGCCAAAT GCCGCAGACC AGAGAGCACT TGTTGCTTGC 300 

CAGACAGGTT GGTGTGCAGA ACTTGGTTGT GTTTGTTAAC AAGGTGGACA 350 

L0 CCATCGACGA CCCCGAAATG TTGGAGTTGG TGGAGATGGA AATGAGAGAA 4 00 

TTGTTGACCC ACTACGGCTT TGACGGTGAC AACACCCCTG TCATCATGGG 4 50 

TTCGGCGTTG TGTGCCTTGG AAGACAGGCA GCCTGAGATT GGCGAGCAAG 500 

CCATCATGAA GTTGTTGGAC GCTGTCGACG AGTACATTCC CACTCCTCAG 550 

AGAGACTTGG AGCAACCATT TTTGATGCCC GTTGAGGATG TTTTCTCCAT 600 

L5 CTCTGGCAGA GGTACTGTTG TCACCGGTCG TGTTGAGAGA GGCTCATTGA 650 

AGAAGGGTGA GGAGATTGAG ATTGTTGGCG ACTTCCCCAA GCCCTTCAAG 700 

ACTACCGTCA CCGGCATTGA GATGTTCAAG AAGGAGTTGG ATGCCGCGAT 750 

GGCGGGCGAC AACGCCGGGA TCTTGTTGAG AGGTGTCAAG AGAGACGAGG 800 

TCTCGAGAGG TATGGTTTTG GCCAAGCCCG GTACTGTCAC TTCGCACACC 850 

20 AAGGTGTTGG CGTCGCTTTA CATCTTGACC AAAGAGGAAG GTGGCCGCCA 900 

CTCGCCCTTT GGTGAGAACT ACAAGCCACA GTTATTCATC AGAACCTCCG 950 

ATGTCACTGG TACTTTGAGG TTCCCCGCCG GTGAG 985 



25 



35 



10 



2) INFORMATION FOR SEQ ID NO: 1959 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 973 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 

(B) STRAIN: ATCC 10565 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1959 



CGGTAAGACC ACCTTGACTG CCGCCATCAC CAAGGTTCTC TCCGAGAAGG 50 

GTGGTGCCGA CTTCTTGGAC TACGGTGCCA TTGACAGAGC CCCCGAGGAG 100 

15 CGTGCCCGTG GTATCACCAT CTCCACTGCC CACGTTGAGT ACGAGACTGA 150 

CAACCGTCAC TACGCCCACA TTGACTGTCC CGGTCACGCT GATTACATCA 200 

AGAACATGAT TACCGGTGCC GCCCAGATGG ACGGTGCCAT TATTGTCCTT 250 

GCTGCTACTG ACGGTGCCAT GCCCCAGACC CGCGAGCACT TGCTTCTCGC 300 

CCGTCAGGTT GGTATCCAGG AATTGGTTGT GTTTGTGAAC AAGGTTGACA 350 

50 CCATCGACGA CCCCGAGATG TTGGAGCTCG TTGAGATGGA GATCCGCGAG 4 00 

TTGTTGTCTG AGTTCGGTTT TGACGGTGAC AACACCCCCG TCATCATGGG 4 50 

TTCCGCTTTG TGCGCTTTGG AGGGCAAGCA GCCCGAGATT GGTGAGCAGG 500 

CTATCACCAA GTTGATGGCC GCCGTTGACG AGCACATCCC CACCCCCCAG 550 
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CGTGACTTGG AGCAGCCTTT 
CTCTGGCCGT GGTACCGTGG 
AGAAGGGTGA GGAGATTGAG 
GTGACTGTTA CTGGTATTGA 
GGCTGGTGAC AACGCCGGTA 
TGTCTCGTGG TATGGTTTTG 
AAGGTTTTGG CTTCGCTTTA 
GACCGGCTTC GGCTCCAACT 
ACGTCACTGG TACCCTCACC 



CTTGATGCCT GTTGAGGGTG TTTTCTCTAT 
TGACTGGTAA GGTTGCCCGT GGTGTCCTCA 
ATTGTTGGCA ACTTTGACAA GCCCTACAAG 
GATGTTCAAG AAGGAGTTGG ACCAGGCCAT 
TCTTGTTGCG TGGTGTCAAG CGTGACGAGG 
GCCAAGCCCG GCACTGTTGT CTCGCACAAG 
CATCTTGACC CAGGAGGAGG GTGGCCGTAA 
ACAAGCCCCA GTTGTTCTTG CGCACTACCG 
TTC 



600 
650 
700 
750 
800 
850 
900 
950 
973 



LO 



L5 



10 



15 



10 



15 



[0 



[5 



2) INFORMATION FOR SEQ ID NO: I960 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Candida krusei 

(B) STRAIN: ATCC 28870 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1960 

AAGACTACCT TGACTGCTGC AATCACCAAG GTCTTAGCTG ATCAAGGTGG 50 

TGCTGATTTC TTAGATTATG CATCTATTGA CAAGGCTCCT GAAGAAAGAG 100 

CAAGAGGTAT TACTATCTCT ACTGCTCACG TTGAGTATGA AACCCCAAAC 150 

AGACATTATT CTCATGTCGA TTGTCCTGGC CATCAAGATT ATATTAAGAA 200 

TATGATTACT GGTGCTGCAC AAATGGATGG TGCTATTATT GTTGTTGCTG 250 

CTACTGATGG TCAAATGCCA CAAACTAAGG AACATTTATT ATTAGCAAGA 300 

CAAGTTGGTG TTCAACATTT AGTTGTCTTT GTTAATAAAT GTGACACCAT 350 

TGATGACCCA GAAATGTTGG AATTAGTTGA AATGGAAATG AGAGAACTAT 4 00 

TGTCTGAATA TGGTTTTGAT GGTGATAACA CTCCAGTTAT TATGGGTTCT 450 

GCATTGATGG CTTTAGAAGA CAAGAGACCT GAAGTTGGTA AGGAATCTAT 500 

TTTAAAGTTA ATGGAAGCYG TTGACACATG GATTCCAACC CCAGAGAGAG 550 

ATTTAGAAAA ACCATTTTTG TTACCTATTG ATGAAGTTTT CTCAATCTCT 600 

GGTAGAGGTA CTGTCGTTTC TGGTACTGTC GAAAGAGGTA CTTTGAAGAA 650 

GGGTGAAGAA GTTGAAATTG TTGGTGGTAA GGATGGTTCT ATTAAAACTA 700 

CTGTCACAGG TATTGAAATG TATCACAAGG AATTAGACCA AGCGCAAGCA 750 

GGTGATACTC CAGGTATTTT ATTAAGAGGT GTCAAGAGAG ACCAAATCAA 800 

GAGAGGTCAA ATTTTAGCAA AGCCAGATTC CGTTAAGGCA TACAAGAAGT 850 

TCTTGGCTTC CCTTTATATC TTAACCAAGG AAGAAGGTGG TAGACATACA 900 

CCATTCTCTG AAAACTACAG ACCACAAATG TACATCAGAA CTACCAATGT 950 

TAACGTTACT TTGAAGTTCC CAGACACTGA AGAAG 985 



2) INFORMATION FOR SEQ ID NO: 1961 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1961 
GCTCAAGGCA GATGGCATTC CC 



2) INFORMATION FOR SEQ ID NO: 1962 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1962 
GGACAAGGCG GTTGCGTTTG AT 



2) INFORMATION FOR SEQ ID NO: 1963 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1963 
CATTCCCGTC TCGCTCGACA GT 



2) INFORMATION FOR SEQ ID NO: 1964 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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5 



25 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1964 
ATCTGCCTGC CCGTCTTGC 19 



LO 2} INFORMATION FOR SEQ ID NO: 1965 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE : Nucleic acid 
L5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plasmid pGS05 

(C) ACCESSION NUMBER: M36657 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1965 



ATGAATAAAT CGCTCATCAT TTTCGGCATC GTCAACATAA CCTCGGACAG 50 

TTTCTCCGAT GGAGGCCGGT ATCTGGCGCC AGACGCAGCC ATTGCGCAGG 100 

CGCGTAAGCT GATGGCCGAG GGGGCAGATG TGATCGACCT CGGTCCGGCA 150 

TCCAGCAATC CCGACGCCGC GCCTGTTTCG TCCGACACAG AAATCGCGCG 200 

30 TATCGCGCCG GTGCTGGACG CGCTCAAGGC AGATGGCATT CCCGTCTCGC 250 

TCGACAGTTA TCAACCCGCG ACGCAAGCCT ATGCCTTGTC GCGTGGTGTG 300 

GCCTATCTCA ATGATATTCG CGGTTTTCCA GACGCTGCGT TCTATCCGCA 350 

ATTGGCGAAA TCATCTGCCA AACTCGTCGT TATGCATTCG GTGCAAGACG 4 00 

GGCAGGCAGA TCGGCGCGAG GCACCCGCTG GCGACATCAT GGATCACATT 4 50 

35 GCGGCGTTCT TTGACGCGCG CATCGCGGCG CTGACGGGTG CCGGTATCAA 500 

ACGCAACCGC CTTGTCCTTG ATCCCGGCAT GGGGTTTTTT CTGGGGGCTG 550 

CTCCCGAAAC CTCGCTCTCG GTGCTGGCGC GGTTCGATGA ATTGCGGCTG 600 

CGCTTCGATT TGCCGGTGCT TCTGTCTGTT TCGCGCAAAT CCTTTCTGCG 650 

CGCGCTCACA GGCCGTGGTC CGGGGGATGT CGGGGCCGCG ACACTCGCTG 700 

10 CAGAGCTTGC CGCCGCCGCA GGTGGAGCTG ACTTCATCCG CACACACGAG 750 

CCGCGCCCCT TGCGCGACGG GCTGGCGGTA TTGGCGGCGC TGAAAGAAAC 800 

CGCAAGAATT CGTTAA 816 



15 

2) INFORMATION FOR SEQ ID NO: 1966 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1966 
5 CATGCCAGTC TTGCCAACG 19 



10 



2) INFORMATION FOR SEQ ID NO: 1967 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1967 

20 

CAGCAATAAG TAATCCAGCG ATG 23 



25 2) INFORMATION FOR SEQ ID NO: 1968 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1968 

GGAGAGATTT CACCGCATAG 20 



10 



50 



2) INFORMATION FOR SEQ ID NO: 1969 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1969 

AGCCAACCAT CATGCTATTC CA 22 
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2) INFORMATION FOR SEQ ID NO: 1970 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Transposon TnlO 
(C) ACCESSION NUMBER: J01830 

L5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1970 

ATGAATAGTT CG AC AAA GAT CGCATTGGTA ATTACGTTAC TCGATGCCAT 50 

GGGGATTGGC CTTATCATGC CAGTCTTGCC AACGTTATTA CGTGAATTTA 100 

TTGCTTCGGA AGATATCGCT AACCACTTTG GCGTATTGCT TGCACTTTAT 150 

2 0 GCGTTAATGC AGGTTATCTT TGCTCCTTGG CTTGGAAAAA TGTCTGACCG 200 
ATTTGGTCGG CGCCCAGTGC TGTTGTTGTC ATTAATAGGC GCATCGCTGG . 250 

ATTACTTATT GCTGGCTTTT TCAAGTGCGC TTTGGATGCT GTATTTAGGC 300 

CGTTTGCTTT CAGGGATCAC AGGAGCTACT GGGGCTGTCG CGGCATCGGT 350 

CATTGCCGAT ACCACCTCAG CTTCTCAACG CGTGAAGTGG TTCGGTTGGT 400 

25 TAGGGGCAAG TTTTGGGCTT GGTTTAATAG CGGGGCCTAT TATTGGTGGT 4 50 

TTTGCAGGAG AGATTTCACC GCATAGTCCC TTTTTTATCG CTGCGTTGCT .500 

AAATATTGTC ACTTTCCTTG TGGTTATGTT TTGGTTCCGT GAAACCAAAA 550 

ATACACGTGA TAATACAGAT ACCGAAGTAG GGGTTGAGAC GCAATCGAAT 600 

TCGGTATACA TCACTTTATT TAAAACGATG CCCATTTTGT TGATTATTTA 650 

JO TTTTTCAGCG CAATTGATAG GCCAAATTCC CGCAACGGTG TGGGTGCTAT 700 

TTACCGAAAA TCGTTTTGGA TGGAATAGCA TGATGGTTGG CTTTTCATTA 750 

GCGGGTCTTG GTCTTTTACA CTCAGTATTC CAAGCCTTTG TGGCAGGAAG 800 

AATAGCCACT AAATGGGGCG AAAAAACGGC AGTACTGCTC GAATTTATTG 850 

CAGATAGTAG TGCATTTGCC TTTTTAGCGT TTATATCTGA AGGTTGGTTA 900 

15 GATTTCCCTG TTTTAATTTT ATTGGCTGGT GGTGGGATCG CTTTACCTGC 950 

ATTACAGGGA GTGATGTCTA TCCAAACAAA GAGTCATGAG CAAGGTGCTT 1000 

TACAGGGATT ATTGGTGAGC CTTACCAATG CAACCGGTGT TATTGGCCCA 1050 

TTACTGTTTA CTGTTATTTA TAATCATTCA CTACCAATTT GGGATGGCTG 1100 

GATTTGGATT ATTGGTTTAG CGTTTTACTG TAT TAT TAT C CTGCTATCGA 1150 

tO TGACCTTCAT GTTAACCCCT CAAGCTCAGG GGAGTAAACA GGAGACAAGT 1200 

GCTTAG 1206 



15 2) INFORMATION FOR SEQ ID NO: 1971 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

;0 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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N. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1971 
CYGACTGYGC CATCCTYATC A 21 

5 

2) INFORMATION FOR SEQ ID NO: 1972 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear' 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1972 
MGICAGCTCA TYITTGCWKS C 21 

20 

2) INFORMATION FOR SEQ ID NO: 1973 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1973 
35 RACACCRGIY TTGGWITCCT T 21 



to 



2) INFORMATION FOR SEQ ID NO: 1974 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1974 

50 

ACAAGGGITG GRMSAAGGAG AC 22 
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2) INFORMATION FOR SEQ ID NO: 1975 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1975 
TGRCCRGGGT GGTTRAGGAC G 



2) INFORMATION FOR SEQ ID NO: 1976 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1976 
GATGGAYTCY GTYAAITGGG A 



2) INFORMATION FOR SEQ ID NO: 1977 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1977 
GATGGAYTCY GTYAARTGGG A 



2) INFORMATION FOR SEQ ID NO: 1978 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1978 
CATCITGYAA TGGYAATCTY AAT 23 



2) INFORMATION FOR SEQ ID NO: 1979 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1979 
CATCYTGYAA TGGYAASCTY AAT 23 

2) INFORMATION FOR SEQ ID NO: 1980 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1980 
40 TCRATGGCIT CIAIRAGRGT YT 22 



2) INFORMATION FOR SEQ ID NO: 1981 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1981 

TGGACACCIS CAAGIGGKCY G 



2) INFORMATION FOR SEQ ID NO: 1982 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1982 
TGGACACYIS CAAGIGGKCY G 



2) INFORMATION FOR SEQ ID NO: 1983 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1983 
CYGAYTGCGC YATICTCATC A 



2) INFORMATION FOR SEQ ID NO: 1984 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1984 
CYGAYTGYGC YATYCTSATC A 
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L5 



2) INFORMATION FOR SEQ ID NO: 1985 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi)- ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: Ml-106 

(C) ACCESSION NUMBER: U81804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1985 



ATGGGTAAGG ACAAGCTGCA CGTCAACGTC GTTGTTATCG GTCACGTCGA 50 

CTCCGGTAAG TCGACCACCA CCGGTCACTT GATCTACAAG TGCGGTGGTA 100 

2 0 TCGACAAGCG AACCATTGAG AAGTTCGAGA AGGAGGCTCA AGAGCTCGGA 150 

AAGTCTTCTT TCAAGTACGC TTGGGTTCTT GACAAGCTTA AGGCCGAGCG 200 

AGAGCGAGGT ATCACCATCG ACATTGCTCT TTGGAAGTTC GAGACCCCTA 250 

AGTACCAGGT TACCGTCATT GACGCCCCCG GTCACCGAGA CTTCATCAAG 300 

AACATGATCA CCGGTACCTC CCAGGCTGAC TGTGCCATCC TCATCATTGC 350 

25 CACCGGTATC GGTGAGTTCG AGGCTGGTAT CTCCAAGGAC GGTCAGACCC 4 00 

GAGAGCACGC CCTCCTCGCC TTCACCCTCG GTGTCAGGCA GCTCATTGTT 4 50 

GCTTGCAACA AGATGGACAC CTGCAAGTGG TCTGAGGACC GATTCAACGA 500 

AATCGTCAAG GAGACCAACG GTTTCATCAA GAAGGTTGGT TACAACCCCA 550 

AGGCTGTCCC CTTCGTCCCC ATCTCTGGTT GGCACGGTGA CAACATGTTG 600 

30 GAGGAGACCA CCAACATGCC CTGGTACAAG GGATGGACCA AGGAGACCAA 650 

GTCCGGTGTT TCCAAGGGTA AGACCCTTCT CGAGGCCATC GACGCCAGTA 700 

GGCCCCCTAC CCGACCCACC GACAAGCCCC TCCGTCTCCC TCTCCAGGAC 750 

GTCTACAAGA TCGGTGGTAT CGGCACAGTC CCTGTCGGCC GAGTCGAGAC 800 

CGGTGTCATC AAGGCCGGTA TGGTCGTCAA GTTCGCCCCC ACCAACGTCA 850 

35 CCACTGAAGT CAAGTCCGTT GAGATGCACC ACGAGCAGAT CCCCGAGGGT 900 

CTCCCCGGAG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCATCAAGGA 950 

CATCCGACGA GG TAACGTCT GTGGTGACTC CAAGAACGAC CCCCCTATGG 1000 

AGGCTGCTTC TTTCAACGCC CAGGTTATCG TCCTTAACCA CCCTGGTCAG 1050 

ATCGGTGCCG GTTACACCCC CGTTCTCGAC TGTCACACTG CCCACATTGC 1100 

10 TTGCAAGTTC TCTGAGTTGA TCGAGAAGAT TGACCGACGA ACCGGTAAGG 1150 

TCATGGAGGC CGCCCCCAAG TTCGTCAAGT CTGGTGACGC CGCCATTGTC 1200 

AAGCTTGTTT CCCAGAAGCC TCTCTGTGTT GAGACCTACG CCGACTACCC 1250 

CCCTCTTGGT CGATTCGCCG TCCGAGACAT GCGACAGACC GTTGCCGTTG 1300 

GTGTTATTAA GAGTGTGGAG AAGTCCGATG GGAAGAGCGG CAAGGTTACC 1350 

15 AAGGCCGCCG AGAAGGCTGC TAAGAAGAAG TAA 1383 



2) INFORMATION FOR SEQ ID NO: 1986 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1380 bases 

(B) TYPE: Nucleic acid 
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L0 



(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: B3501 

(C) ACCESSION NUMBER: U81803 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1986 



ATGGGTAAGG ACAAGCTGCA CGTCAACGTC t GTTGTTATCG GTCACGTCGA 50 

CTCCGGTAAG TCGACCACCA CCGGTCACTT GATCTACAAG TGCGGTGGTA 100 

L5 TCGACAAGCG AACCATTGAG AAGTTCGAGA AGGAGGCTCA AGAGCTCGGA 150 

AAGTCTTCTT TCAAGTACGC TTGGGTTCTT GACAAGCTTA AGGCCGAGCG 200 

AGAGCGAGGT ATCACCATCG ACATTGCTCT TTGGAAGTTC GAGACCCCCA .250 

GGTACCAGGT CACCGTCATT GACGCCCCCG GTCACCGAGA CTTCATCAAG 300 

AACATGATCA CCGGTACCTC CCAGGCTGAC TGTGCCATCC TCATCATTGC 350 

2 0 CACCGGTATC GGTGAGTTCG AGGCCGGTAT CTCCAAGGAC GGTCAGACCC 4 00 

GAGAGCACGC CCTCCTCGCC TTCACCCTCG GTGTCAGGCA GCTCATTGTT 450 

GCTTGCAACA AGATGGACAC CTGCAAGTGG TCCGAGGACC GATTCAACGA 500 

AATCGTCAAG GAGACCAACG GTTTCATCAA GAAGGTTGGC TACAACCCCA 550 

AGGCTGTCCC CTTCGTCCCC ATCTCTGGTT GGCACGGTGA CAACATGTTG 600 

25 GAGGAGACCA CCAACATGCC CTGGTACAAG GGATGGACCA AGGAGACCAA 650 

GTCTGGTGTT TCCAGGGGTA AGACCCTTCT CGAGGCCATC AGCGCCAGTA 700 

GGCCCCATAC CCGACCCACC GACAAGCCCC TCCGTCTCCC TCTCCAGGAC 750 

GTCTACAAGA TCGGTGGTAT CGGCACAGTC CCTGTCGGCC GAGTCGAGAC 800 

CGGTGTCATC AAGGCCGGTA TGGTCGTCAA GTTCGCCCCC ACCAACGTCA 850 

30 CCACTGAAGT CAAGTCCGTT GAGATGCACC ACGAGCAGAT CCCCGAGGGT 900 

CTTCCCGGAG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCATCAAGGA 950 

CATCCGACGA GGTAACGTCT GTGGTGACTC CAAGAACGAC CCCCCTATGG 1000 

AGGCTGCTTC TTTCAACGCC CAGGTTATCG TCCTTAACCA CCCTGGTCAG 1050 

ATCGGTGCCG GTTACACCCC CGTTCTCGAC TGTCACACTG CCCACATTGC 1100 

35 CTGCAAGTTT GCTGAGTTGA TCGAGAAGAT TGACCGACGA ACCGGTAAGG 1150 

TCATGGAGGC CGCCCCCAAG TTCGTCAAGT CTGGTGACGC CGCCATTGTC 1200 

AAGCTTGTTG CCCAGAAGCC CCTCTGTGTT GAGACCTACG CCGACTACCC 1250 

CCCTCTTGGT CGATTCGCCG TCCGAGACAT GCGACAGACC GTTGCCGTTG 1300 

GTGTTATCAA GAGCGTGGAC AAGACCGAGA AGGGTGGCAA GGTCACCAAG 1350 

3 0 GCTGCTGAGA AGGCTGCCAA GAAGAAGTAA 1380 



15 



2) INFORMATION FOR SEQ ID NO: 1987 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50- (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
(C) ACCESSION NUMBER: X01638 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1987 



ATGGGTAAAG AGAAGTCTCA CATTAACGTT GTCGTTATCG GTCATGTCGA 50 

TTCTGGTAAG TCTACCACTA CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAA AAGTTCGAAA AGGAAGCCGC TGAATTAGGT 150 

10 AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG 200 

AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 250 

AGTACCAAGT TACCGTTATT GATGCTCCAG GTCACAGAGA TTTCATCAAG 300 

AACATGATTA CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 

TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAACCA 4 00 

15 GAGAACACGC TTTGTTGGCT TTCACCTTGG GTGTTAGACA ATTGATTGTT 450 

GCTGTCAACA AGATGGACTC CGTCAAATGG GACGAATCCA GATTCCAAGA 500 

AATTGTCAAG GAAACCTCCA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 

AGACTGTTCC ATTCGTCCCA ATCTCTGGTT GGAACGGTGA CAACATGATT 600 

GAAGCTACCA CCAACGCTCC ATGGTACAAG GGTTGGGAAA AGGAAACCAA 650 

20 GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGCCATT GACGCCATTG 700 

AACAACCATC TAGACCAACT GACAAGCCAT TGAGATTGCC ATTGCAAGAT 750 

GTTTACAAGA TCGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC TTTCGCCCCA GCTGGTGTTA 850 

CCACTGAAGT CAAGTCCGTT GAAATGCATC ACGAACAATT GGAACAAGGT 900 

25 GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 950 

AATCAGAAGA GGTAACGTCT GTGGTGACGC TAAGAACGAT CCACCAAAGG 1000 

GTTGCGCTTC TTTCAACGCT ACCGTCATTG TTTTGAACCA TCCAGGTCAA 1050 

ATCTCTGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 1100 

TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA 1150 

30 AGTTGGAAGA CCATCCAAAG TTCTTGAAGT CCGGTGACGC TGCTTTGGTC 1200 

AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

ACCATTAGGT AGATTCGCTG TCAGAGACAT GAGACAAACT GTCGCTGTCG 1300 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGCCGCTAA GGTTACCAAG 1350 

GCTGCTCAAA AGGCTGCTAA GAAATAA 1377 

35 



2) INFORMATION FOR SEQ ID NO: 1988 



40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
50 (C) ACCESSION NUMBER: M10992 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1988 
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ATGGGTAAAG AGAAGTCTCA CATTAACGTT GTCGTTATCG GTCATGTCGA 50 

TTCTGGTAAG TCTACCACTA CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAA AAGTTCGAAA AGGAAGCCGC TGAATTAGGT 150 

AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG 200 

5 AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 250 

AGTACCAAGT TACCGTTATT GATGCTCCAG GTCACAGAGA TTTCATCAAG 300 

AACATGATTA CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 

TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAACCA 400 

GAGAACACGC TTTGTTGGCT TTCACCTTGG GTGTTAGACA ATTGATTGTT 450 

10 GCTGTCAACA AGATGGACTC CGTCAAATGG GACGAATCCA GATTCCAAGA 500 

AATTGTCAAG GAAACCTCCA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 

AGACTGTTCC ATTCGTCCCA ATCTCTGGTT GGAACGGTGA CAACATGATT 600 

GAAGCTACCA CCAACGCTCC ATGGTACAAG GGTTGGGAAA AGGAAACCAA 650 

GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGCCATT GACGCCATTG 700 

15 AACAACCATC TAGACCAACT GACAAGCCAT TGAGATTGCC ATTGCAAGAT 750 

GTTTACAAGA TTGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC TTTTGCCCCA GCTGGTGTTA 850 

CCACTGAAGT CAAGTCCGTT GAAATGCATC ACGAACAATT GGAACAAGGT 900 

GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 950 

20 AATCAGAAGA GGTAACGTCT GTGGTGACGC TAAGAACGAT CCACCAAAGG 1000 

GTTGCGCTTC TTTCAACGCT ACCGTCATTG TTTTGAACCA TCCAGGTCAA 1050 

ATCTCTGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 1100 

TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA 1150 

AGTTGGAAGA CCATCCAAAG TTCTTGAAGT CCGGTGACGC TGCTTTGGTC 1200 

25 AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

ACCATTAGGT AGATTCGCTG TCAGAGACAT GAGACAAACT GTCGCTGTCG 1300 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGCCGCTAA GGTTACCAAG 1350 

GCTGCTCAAA AGGCTGCTAA GAAATAA 1377 



30 



2) INFORMATION FOR SEQ ID NO: 1989 



. (i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eremothecium gossypii 

(B) STRAIN: ATCC 10895 

45 (C) ACCESSION NUMBER: X73978 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1989 

ATGGGTAAGG AAAAGACTCA CGTTAACGTT GTCGTCATCG GTCACGTCGA 50 

50 CTCTGGTAAG TCTACTACCA CCGGTCACTT GATCTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAG AAGTTCGAGA AGGAGGCTGC CGAGTTGGGT 150 

AAGGGTTCTT TCAAGTACGC CTGGGTTTTG GACAAATTGA AGGCTGAGAG 2 00 

AGAGAGAGGT ATCACCATCG ACATTGCGTT GTGGAAGTTC GAGACTCCAA 250 
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AGTACCACGT CACTGTCATT GACGCCCCAG GCCACAGAGA CTTCATCAAG 300 

AACATGATTA CCGGTACTTC TCAAGCTGAC TGTGCCATCT TGATCATTGC 350 

TGGTGGTGTC GGTGAGTTCG AGGCTGGTAT CTCCAAGGAC GGTCAGACCA 400 

GAGAGCACGC TTTGTTGGCT TACACCTTGG GTGTCAAGCA GTTGATCGTT 450 

5 GCCATCAACA AGATGGACTC CGTCAAGTGG GACGAGTCCA GATACCAGGA 500 

GATTGTCAAG GAGACCTCCA ACTTCATCAA GAAGGTCGGT TACAACCCTA 550 

AGACTGTTCC ATTCGTTCCA ATCTCCGGCT GGAACGGTGA CAACATGATT 600 

GAGGCCACCA CCAACGCCCC ATGGTACAAG GGCTGGGAGA AGGAGACCAA 650 

GGCTGGTGCC GTCAAGGGTA AGACCTTGTT GGAGGCCATT GACGCCATTG 700 

LO AGCCACCTGT CAGACCAACT GACAAGGCAT TGAGATTGCC ATTGCAGGAT 750 

GTCTACAAGA TCGGTGGTAT TGGTACGGTT CCAGTCGGCA GAGTCGAGAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC CTTCGCCCCA TCCGGTGTCA 850 

CCACTGAAGT CAAGTCCGTC GAGATGCACC ACGAGCAATT GGAGGAGGGT 900 

GTCCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTCT CCGTCAAGGA 950 

L5 GATCAGAAGA GGTAACGTTT GCGGTGACTC CAAGAACGAC CCACCAAAGG 1000 

CTGCTGAGTC CTTCAACGCT, ACCGTCATTG TCTTGAACCA CCCAGGTCAA 1050 

ATCTCTGCCG GTTACTCTCC AGTCTTGGAC TGTCACACTG CCCACATTGC 1100 

TTGTAAGTTC GACGAGTTGT TGGAGAAGAA CGACAGAAGA ACCGGTAAGA 1150 

AGTTGGAAGA CTCTCCAAAG TTCCTAAAGG CCGGTGACGC TGCCATGGTC 1200 

20 AAGTTTGTCC CATCCAAGCC AATGTGTGTT GAGGCTTTCA CCGACTACCC 1250 

ACCATTGGGT AGATTCGCTG TCAGAGACAT GAGACAGACC GTTGCTGTCG 1300 

GTGTCATCAA GTCTGTTGTC AAGTCCGACA AGGCTGGTAA GGTCACCAAG 1350 

GCCGCCCAAA AGGCTGGTAA GAAATAG 1377 



25 



40 



2) INFORMATION FOR SEQ ID NO: 1990 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eremothecium gossypii 
(C) ACCESSION NUMBER: A29820 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1990 



ATGGGTAAGG AAAAGACTCA CGTTAACGTT GTCGTCATCG GTCACGTCGA 50 

CTCTGGTAAG TCTACTACCA CCGGTCACTT GATCTACAAG TGTGGTGGTA 100 

3 5 TTGACAAGAG AACCATCGAG AAGTTCGAGA AGGAGGCTGC CGAGTTGGGT 150 

AAGGGTTCTT TCAAGTACGC CTGGGTTTTG GACAAATTGA AGGCTGAGAG 200 

AGAGAGAGGT ATCACCATCG ACATTGCGTT GTGGAAGTTC GAGACTCCAA 250 
AGTACCACGT CACTGTCATT GACCCCCCAG GCCACAGAGA CTTCATCAAG ' 300 

AACATGATTA CCGGTACTTC TCAAGCTGAC TGTGCCATCT TGATCATTGC 350 

50 TGGTGGTGTC GGTGAGTTCG AGGCTGGTAT CTCCAAGGAC GGTCAGACCA 400 

GAGAGCACGC TTTGTTGGCT TACACCTTGG GTGTCAAGCA GTTGATCGTT 4 50 

GCCATCAACA AGATGGACTC CGTCAAGTGG GACGAGTCCA GATACCAGGA 500 

GATTGTCAAG GAGACCTCCA ACTTCATCAA GAAGGTCGGT TACAACCCTA 550 
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AGACTGTTCC ATTCGTTCCA ATCTCCGGCT GGAACGGTGA CAACATGATT 600 

GAGGCCACCA CCAACGCCCC ATGGTACAAG GGCTGGGAGA AGGAGACCAA 650 

GGCTGGTGCC GTCAAGGGTA AGACCTTGTT GGAGGCCATT GACGCCATTG 700 

AGCCACCTGT CAGACCAACT GACAAGGCAT TGAGATTGCC ATTGCAGGAT 750 

5 GTCTACAAGA TCGGTGGTAT TGGTACGGTT CCAGTCGGCA GAGTCGAGAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC CTTCGCCCCA TCCGGTGTCA 850 

CCACTGAAGT CAAGTCCGTC GAGATGCACC ACGAGCAATT GGAGGAGGGT 900 

GTCCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTCT CCGTCAAGGA 950 

GATCAGAAGA GGTAACGTTT GCGGTGACTC CAAGAACGAC CCACCAAAGG 1000 

LO CTGCTGAGTC CTTCAACGCT ACCGTCATTG TCTTGAACCA CCCAGGTCAA 1050 

ATCTCTGCCG GTTACTCTCC AGTCTTGGAC TGTCACACTG CCCACATTGC 1100 

TTGTAAGTTC GACGAGTTGT TGGAGAAGAA CGACAGAAGA ACCGGTAAGA 1150 

AGTTGGAAGA CTCTCCAAAG TTCCTAAAGG CCGGTGACGC TGCCATGGTC 1200 

AAGTTTGTCC CATCCAAGCC AATGTGTGTT ' GAGGCTTTCA CCGACTACCC 1250 

L5 ACCATTGGGT AGATTCGCTG TCAGAGACAT GAGACAGACC GTTGCTGTCG 1300 

GTGTCATCAA GTCTGTTGTC AAGTCCGACA AGGCTGGTAA GGTCACCAAG 1350 

GCCGCCCAAA AGGCTGGTAA GAAATAG 1377 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1991 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1646 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Aspergillus oryzae 

(B) STRAIN: KBN616 

(C) ACCESSION NUMBER: AB007770 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1991 



TGGGGTAAGT TTATCAACCC GTCGAGTTGT GTTGCATCTC AGATCATGGC 50 

TGACAAGTAC TTTCCTCCTT ACAGTAAGGA AGACAAGCAG CACATCAACA 100 

4 0 TCGTCGTTAT CGGCCACGTC GATTCCGGCA AGTCCACCAC CACTGGTCAC 150 

TTGATCTACA AGTGTGGTGG TATCGACCAG CGTACCATCG AGAAGTTCGA 200 

GAAGGAAGCC GCTGAGCTCG GTAAGGGTTC CTTCAAGTAC GCCTGGGTTC 250 

TTGACAAGCT CAAGTCCGAG CGTGAGCGTG GTATCACCAT CGATATCGCC 300 

CTCTGGAAGT TCCAGACCTC CAAGTATGAG GTCACCGTCA TTGGTAAGCA 350 

45 TTTGAGTTCC AACCTACGTT GCCCAACATT TACAGTCATC TAACAAAGTT 400 

CAATAGATGC CCCCGGTCAC CGTGACTTCA TCAAGAACAT GATCACTGGT 450 

ACTTCCCAGG CTGACTGCGC TATCCTCATC ATTGCCTCCG GTACTGGTGA 500 

ATTCGAGGCT GGTATCTCCA AGGATGGTCA GACCCGTGAG CACGCTCTGC 550 

TCGCTTTCAC CCTCGGTGTC CGTCAGCTCA TCGTTGCCCT CAACAAGATG 600 

50 GACACCTGCA AGTGGTCTCA GGATCGTTAC AACGAAATCG TTAAGGAGAC 650 

TTCCAACTTC ATCAAGAAGG TCGGATACAA CCCCAAGAGC GTTCCTTTCG 700 

TCCCCATCTC CGGTTTCAAC GGTGACAACA TGATTGAGGC CTCCACCAAC 750 

TGCCCCTGGT ACAAGGGCTG GGAGAAGGAG ACCAAGGCTG GCAAGTCCAC 800 
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CGGTAAGACC 
CCACCGACAA 
GGTATCGGTA 



CTTCTCGAGG 
GCCTCTCCGT 
CTGTGCCCGT 



CCATCGATGC 
CTTCCCCTCC 
CGGTCGTGTC 



CATCGAGCCC 
AGGATGTCTA 
GAGACTGGTG 



CCCGTCCGTC 
CAAGATCTCT 
TCATCAAGCC 



850 
900 
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TGGTATGGTC GTTACTTTCG CTCCTGCCAA CGTGACCACT GAAGTCAAGT 1000 

CCGTTGAAAT GCACCACCAG CAGCTCCAGG CCGGTAACCC CGGTGACAAC 1050 

GTTGGTTTCA ACGTCAAGAA CGTCTCCGTC AAGGAAGTCC GCCGTGGTAA 1100 

CGTTGCCGGT GACTCCAAGA ACGACCCCCC TGCTGGCTGC GATTCCTTCA 1150 

ACGCCCAGGT CATCGTCCTT AACCACCCCG GTCAGGTCGG CAACGGTTAC 1200 

GCTCCCGTCC TGGACTGCCA CACCGCTCAC ATTGCTTGCA AGTTCGCTGA 1250 

GCTCCTTGAG AAGATTGACC GCCGTACCGG TAAATCTGTT GAGGACAAGC 1300 

CCAAGTTCAT CAAGTCTGGT GATGCTGCCA TCGTCAAGAT GATTCCCTCC 1350 

AAGCCCATGT GTGTGGAGTC TTTCACTGAC TTCCCCCCTC TTGGTCGTTT 14 00 

CGCTGTCCGT GACGTAAGTT TTTCCCTCTT GACTATCTTC ACAATTTTTC 14 50 

ACATATTTTC ACGCCTCGTC CCACTCTTTT TCCTCCCTTC CTCTTTGGTT 1500 

CCCCTTTTTG CCTGCAAGTT CTCTATAGCT AACATGATGT CTAGATGCGT 1550 

CAAACTGTTG CCGTCGGAGT, TATCAAGTCG GTTGAGAAGA ACACTGGCGG 1600 

TTCTGGCAAG GTCACCAAGG CCGCCCAGAA GGCTGGCAAG AAATAA 1646 



2) INFORMATION FOR SEQ ID NO: 1992 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1380 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aureobasidium pullulans 

(B) STRAIN: R106 

(C) ACCESSION NUMBER: U19723 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1992 

ATGGGTAAGG AAAAGTCCCA CATCAACGTC GTCGTTATCG GCCACGTCGA 50 

CTCCGGTAAG TCGACCACCA CCGGTCACTT GATCTACAAG TGCGGTGGTA 100 

TCGACAAGCG TACCATCGAG AAGTTCGAGA AGGAAGCCGC CGAACTCGGC 150 

AAGGGTTCCT TCAAGTACGC CTGGGTCCTC GACAAGCTGA AGTCTGAGCG 200 

TGAGCGTGGT ATCACTATCG ATATCGCTCT GTGGAAGTTC GAGACCCCCA 250 

AGTACATGGT CACCGTCATC GATGCCCCCG GTCACCGTGA TTTCATCAAG 300 

AACATGATCA CTGGTACCTC CCAGGCTGAC TGCGCCATTC TCATCATTGC 350 

CGCCGGTACT GGTGAGTTCG AGGCTGGTAT CTCCAAGGAT GGCCAGACTC 4 00 

GTGAGCACGC CCTTCTCGCC TACACCCTTG GTGTCAAGCA GCTCATCGTC 4 50 

GCTATCAACA AGATGGACAC CACCAAGTGG TCTGAGGCCC GTTACCAGGA 500 

GATCATCAAG GAGACCTCCG GTTTCATCAA GAAGGTCGGC TACAACCCCA 550 

AGCACGTTCC CTTTGTCCCC ATCTCGGGTT TCAACGGTGA CAACATGATT 600 

GAGGTTTCTT CCAACTGCCC CTGGTACAAG GGTTGGGAGA AGGAGACCAA 650 

GGCCAAGGCC ACTGGTAAGA CTCTCCTCGA GGCCATTGAC GCCATCGACC 700 

CTCCTTCGCG CCCCACCGAC AAGCCCCTCC GTCTTCCCCT CCAGGATGTC 750 

TACAAGATCG GTGGTATTGG CACGGTGCCC GTCGGCCGTG TCGAGACCGG 800 
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TACCATCAAG 
CTGAGGTCAA 
CCCGGTGACA 



GGTGGTATGG 
GTCCGTCGAG 
ACGTCGGCTT 



TCGTCACCTT 
ATGCACCACG 
CAACGTCAAG 



CGCCCCCGCT GGTGTCACCA 
AGCAGCTCTC CGAGGGTCTC 
AACGTCTCCG TCAAGGAGAT 



850 
900 
950 



10 



15 



20 



. 25 



30 



35 



40 



45 



CCGTCGTGGT AACGTTGCCG GTGACTCCAA GAACGACCCC CCCAAGGGTT 1000 

GCGACTCCTT CAACGCCCAG GTCATCGTCC TCAACCACCC CGGTCAGGTC 1050 

GGTGCTGGTT ACGCACCCGT CCTCGATTGC CACACTGCCC ACATCGCCTG 1100 

CAAGTTCTCC GAGCTTGTTG AGAAGATTGA CCGCCGTACC GGCAAGTCCG 1150 

TTGAGGCCGC CCCCAAGTTC ATCAAGTCTG GTGACGCCGC CATCGTCAAG 1200 

ATGGTTCCCT CCAAGCCTAT GTGTGTTGAG GCCTTCACTG ACTACCCTCC 1250 

TCTCGGTCGT TTCGCCGTCC GTGACATGAG ACAGACCGTC GCTGTCGGTG 1300 

TCATCAAGTC CGTCGCCAAG TCCGACAAGC AGGGTGCCGG TAAGGTTACC 1350 

AAGGCCGCTG TCAAGGCTGG CAAGAAGTAA 1380 



2) INFORMATION FOR SEQ ID NO: 1993 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulation 

(B) STRAIN: 186AS 

(C) ACCESSION NUMBER: U14100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1993 

ATGGGTAAGG AAGACAAGAC TCACATTAAC CTCGTCGTCA TCGGCCACGT 50 

CGATTCGGGC AAATCTACCA CCACTGGTCA TTTGATCTAC AAATGCGGTG 100 

GTATTGACAG CCGTACCATT GAGAAGTTCG AAAAGGAAGC CGAAGAGTTG 150 

GGCAAGAAAT CCTTCAAATA TGCGTGGGTC CTTGACAAAC TGAAGTCTGA 200 

GCGTGAGCGT GGTATCACCA TCGATATTGC CCTCTGGAAA TTCGAGACTC 250 

CGAAGTACAG TGTCACTGTC ATTGATGCTC CCGGCCATCG TGACTTCATC 300 

AAGAACATGA TCACTGGTAC CTCCCAGGCT GACTGCGCTA TCCTCATCAT 350 

TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CTCGTGAGCA CGCTCTGCTT GCTTTCACCC TTGGTGTGAG GCAACTCATC 450 

GTTGCCATCA ACAAGATGGA CACCACCAAG TGGTCCGAGT CCCGTTTCAA 500 

CGAAATCATC AAGGAGGTTT CCAACTTCAT CAAGAAGGTC GGATATAACC 550 

CCAAGGCTGT TCCCTTCGTG CCAATCTCTG GTTTCGAGGG TGACAACATG 600 

ATTGAACCCT CCCCCAACTG CACATGGTAC AAGGGCTGGA ACAAGGAGAC 650 

TGCCTCTGGC AAGTCTTCTG GTAAAACCCT TCTCGATGCC ATTGACGCCA 700 

TTGAACCCCC AACCCGTCCT ACCGATAAGC CCCTCCGTCT TCCCCTCCAG 750 

GATGTGTACA AAATCTCTGG TATTGGCACT GTTCCCGTCG GACGTGTTGA 800 

GACTGGTGTC ATCAAGCCCG GTATGGTCGT GACTTTCGCT CCCTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAAATGC ACCACCAACA ACTCCAGGCT 900 

GGTTACCCTG GCGACAACGT CGGCTTCAAC GTCAAGAACG TTTCAGTCAA 950 

GGAAGTCCGC CGTGGCAACG TTGCTGGCGA CTCCAAAAAT GATCCCCCTA 1000 

AGGGCTGCGA ATCCTTCAAT GCCCAGGTCA TCGTCCTTAA CCACCCCGGC 1050 
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CTGCCCACAT 1100 

CGTACCGGAA 1150 

TGCTGCTATC 1200 

TCACTGACTA 1250 

ACCGTCGCTG 1300 

TGGCAAGGTC 1350 
1383 



LO 



2) INFORMATION FOR SEQ ID NO: 1994 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1383 bases 
L5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neurospora crassa 
(C) ACCESSION NUMBER: D45837 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1994 

ATGGGCAAGG AGGACAAGAC TCACATCAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCTACCA CTACCGGTCA CTTGATCTAC AAGTGCGGTG 100 

GTATCGACAA GCGTACCATC GAGAAGTTCG AGAAGGAAGC CGCTGAGCTC 150 

30 GGTAAGGGTT CCTTCAAGTA TGCCTGGGTT CTTGACAAGC TCAAGGCCGA 200 

GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG TTCGAGACTC 250 

CCAAGTACTA CGTCACCGTC ATCGATGCCC CCGGTCATCG TGATTTCATC 300 

AAGAACATGA TCACTGGTAC CTCCCAGGCT GATTGCGCTA TCCTCATCAT 350 

TGCCGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

35 CCCGTGAGCA CGCCCTGCTC GCCTACACCC TCGGTGTCAA GCAGCTCATT 450 

GTTGCCATCA ACAAGATGGA CACCACCCAG TGGTCCCAGA CTCGTTTCGA 500 

GGAGATCATC AAGGAGACCA AGAACTTCAT CAAGAAGGTT GGCTACAACC 550 

CCGCTGGTGT CGCTTTCGTC CCCATCTCCG GCTTCAACGG CGACAACATG 600 

CTTGAGCCCT CCACCAACTG CCCCTGGTAC AAGGGTTGGG AGAAGGAGAC 650 

JO CAAGGCCGGC AAGGCCACTG GCAAGACCCT CCTCGAGGCC ATCGACGCCA 700 

TTGAGCCCCC CAAGCGTCCT ACCGACAAGC CCCTCCGTCT TCCCCTCCAG 750 

GATGTCTACA AGATCGGTGG TATCGGCACA GTGCCCGTCG GCCGTATCGA 800 

GACTGGTGTC CTCAAGCCCG GTATGGTCGT TACCTTCGCT CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTTGCTCAG 900 

15 GGTGTCCCCG GTGACAACGT CGGCTTCAAC GTGAAGAACG TTTCCGTCAA 950 

GGATATCCGC CGTGGTAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCTG 1000 

CTGGCGCCGC CTCTTTCACC GCCCAGGTCA TCGTTCTCAA CCACCCCGGT 1050 

CAGGTCGGTG CCCGCTACGC CCCCGTCCTC GACTGCCACA CTGCCCACAT 1100 

TGCCTGCAAG TTCGCCGAGC TCCTCGAGAA GATCGACCGC CGTACTGGTA 1150 

50 AGGCTGTTGA GGCCTCCCCC AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

GTCAAGATGA TTCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACCGACTA 1250 

CCCTCCCCTC GGCCGTTTCG CCGTCCGTGA CATGCGTCAG ACCGTCGCCG 1300 

TCGGTGTCAT CAAGGCCGTC GACAAGTCCA CCGCTGCCGC TGGCAAGGTC 1350 
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5 2) INFORMATION FOR SEQ ID NO: 1995 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 
LO (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

L5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Podospora anserina 
(C) ACCESSION NUMBER: X74799 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1995 

20 

ATGGGCAAGG AGGACAAGAC TCACATCAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCGACCA CCACTGGTCA CTTGATCTAC AAGTGCGGTG 100 

GTATTGACAA GCGTACCATC GAGAAGTTCG AGAAGGAAGC TGCTGAGCTC 150 

GGCAAGGGCT CTTTCAAGTA TGCCTGGGTT CTTGACAAGT TGAAGGCCGA 200 

25 GCGTGAGCGT GGTATCACCA TCGATATTGC CCTCTGGAAG TTCGAGACCC 250 

CCAAGTACTA TGTCACCGTC ATTGATGCCC CCGGCCATCG TGATTTCATC 300 

AAGAACATGA TTACTGGTAC TTCCCAGGCC GATTGCGCCA TTCTCATCAT 350 

TGCCGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 4 00 

CCCGTGAGCA CGCTCTCCTC GCCTACACCC TCGGTGTGAA GCAGCTCATC 4 50 

30 GTCGCCATCA ACAAGATGGA CACCACCAAG TGGTCCGAGG CCCGCTTCAA 500 

C GAG AT CATC AAGGAGACCT CCAACTTCAT CAAGAAGGTC GGCTACAACC 550 

CCAAGACTGT TGCCTTCGTC CCCATCTCCG GTTTCAACGG CGACAACATG 600 

CTTGAGGCTT CCACCAACTG CCCCTGGTAC AAGGGCTGGG AGAAGGAGGT 650 

CAAGGGTGGC AAGGCCACCG GCAAGACCCT CCTTGAGGCC ATCGACTCCA 700 

35 TCGAGCCCCC CAAGCGTCCC ACCGACAAGC CCCTCCGTCT TCCCCTCCAG 7 50 

GATGTCTACA AGATCGGCGG TATCGGCACA GTCCCTGTCG GCCGTATCGA 800 

GACTGGTATC CTCAAGCCCG GTATGGTCGT TACCTTCGCT CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTCGCTGAG 900 

GGTGTTCCCG GTGACAACGT TGGTTTCAAC GTGAAGAACG TCTCCGTCAA 950 

10 GGAAATCCGC CGTGGCAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCCA 1000 

TGGGCGCCGC CTCTTTCGAT GCCCAGGTCA TCGTCCTCAA CCACCCCGGC 1050 

CAGGTCGGTG CTGGTTACGC CCCCGTCCTC GATTGCCACA CTGCCCACAT 1100 

CGCCTGCAAG TTCTCTGAGC TCCTGCAGAA GATCGACCGC CGTACTGGTA 1150 

AGGCCGTTGA GGAGAGCCCC AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

45 GTCAAGATGG TTCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACTGAGTA 1250 

CCCTCCCCTC GGTCGTTTCG CCGTCCGTGA CATGCGTCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGAAGGTC GAGAAGGCCG CTGCTGGTTC CGGCAAGGTT 1350 

ACCAAGTCCG CTGCCAAGGC TGGCAAGAAA TAA 1383 

50 

2) INFORMATION FOR SEQ ID NO: 1996 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1386 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Podospora curvicolla 

<B) STRAIN: VLV 

(C) ACCESSION NUMBER: X96614 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1996 

ATGGGCAAGG AGGACAAGAC TCACATCAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCGACCA CCACTGGTCA CTTGATCTAC AAGTGCGGTG 100 

GTATTGACAA GCGTACCATC GAGAAGTTCG AGAAGGAAGC TGCTGAGCTC 150 

GGCAAGGGCT CTTTCAAGTA TGCCTGGGTT CTTGACAAGT TGAAGGCCGA 200 

GCGTGAGCGT GGTATCACCA TTGATATCGC CCTCTGGAAG TTCGAGACCC 250 

CCAAGTACTA TGTCACCGTC ATCGATGCCC CCGGCCATCG TGATTTCATC 300 

AAGAACATGA TTACTGGTAC TTCCCAGGCC GATTGCGCCA TTCTCATCAT 350 

TGCCGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CCCGTGAGCA CGCTCTCCTC GCCTACACCC TCGGTGTGAA GCAGCTCATC 450 

GTCGCCATCA ACAAGATGGA CACCACCAAA TGGTCCGAGG CCCGCTTCAA 500 

CGAGATCATC AAGGAGACCT CCAACTTCAT CAAGAAGGTC GGCTACAACC 550 

CCAAGACTGT TGCCTTCGTC CCCATCTCCG GTTTCAACGG CGACAACATG 600 

CTTGAGGCTT CCACCAACTG CCCCTGGTAC AAGGGTTGGG AGAAGGAGGT 650 

CAAGGGTGGC AAGGCTACTG GCAAGACCCT CCTCGAGGCC ATCGACTCCA 700 

TCGAGCCCCC CAAGCGTCCC ACCGACAAGC CCCTCCGTCT TCCCCTTCAG 750 

GACGTTTACA AGATCGGCGG TATCGGCACA GTCCCTGTCG GCCGTATCGA 800 

GACTGGTATC CTCAAGCCCG GTATGGTCGT TACCTTCGCC CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTCTCTGAG 900 

GGTGTCCCCG GTGACAACGT TGGTTTCAAC GTGAAGAACG TCTCCGTCAA 950 

GGAAATCCGC CGTGGCAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCTC 1000 

TTGGCGCCGC TTCTTTCGAT GCCCAGGTCA TCGTCCTCAA CCACCCCGGC 1050 

CAGGTCGGTG CTGGTTACGC CCCCGTCCTC GATTGCCACA CTGCCCACAT 1100 

CGCCTGCAAG TTCGCTGAGC TCCTGCAGAA GATCGATCGC CGTACTGGTA 1150 

AGGCTGTTGA GGAGAGCCCT AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

GTCAAGATGA TTCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACTGAGTA 1250 

CCCTCCCCTC GGTCGTTTCG CTGTCCGTGA CATGCGTCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGAAGGTC GAGAAGGCCG CTGCTGGTTC CGGCAAGGTC 1350 

ACCAAGTCCG CTGCCAAGGC TGGTGGCAAG AAATAA 1386 



2) INFORMATION FOR SEQ ID NO: 1997 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sordaria macrospora 

(B) STRAIN: OOO 

(C) ACCESSION NUMBER: X96615 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1997 

ATGGGTAAGG AAGACAAGGC TCACATCAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCCACCA CTACCGGTCA CCTGATCTAC AAGTGCGGTG 100 

GTATCGACAA GCGTACCATC GAGAAGTTCG ( AGAAGGAAGC CGCTGAGCTC 150 

GGCAAGGGTT CCTTCAAGTA TGCCTGGGTT CTTGACAAGC TCAAGGCCGA 200 

GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG TTCGAGACTC 250 

CCAAGTACTA CGTCACCGTC ATCGATGCCC CCGGCCATCG TGATTTCATC 300 

AAGAACATGA TCACTGGTAC CTCCCAGGCT GATTGCGCTA TTCTCATCAT 350 

TGCCGCTGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CTCGTGAGCA. CGCTCTTCTC GCCTACACCC TCGGTGTCAA GCAGCTCATC 450 

GTTGCCATCA ACAAGATGGA CACCACCCAG TGGTCCCAGG CTCGTTTCGA 500 

GGAGATCATC AAGGAGACCA AGAACTTCAT CAAGAAGGTC GGCTACAACC 550 

CCGCCACCGT CGCTTTCGTC CCCATCTCCG GCTTCAACGG CGACAACATG 600 

CTTGAGGCCT CCACCAACTG CCCCTGGTAC AAGGGTTGGG AGAAGGAGAC 650 

CAAGGCCGGC AAGTCCACTG GCAAGACCCT CCTCGAGGCC ATCGACGCCA 700 

TTGAGCAGCC CAAGCGCCCG ACCGACAAGC CCCTCCGTCT TCCCCTCCAG 750 

GATGTCTACA AGATCGGCGG TATCGGCACA GTGCCCGTCG GCCGTATCGA 800 

GACTGGTGTC CTCAAGCCCG GTATGGTCGT TACCTTCGCT CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTTGCTCAG 900 

GGTGTTCCCG GTGACAACGT CGGCTTCAAC GTGAAGAACG TTTCCGTCAA 950 

GGATATCCGT CGTGGTAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCTG 1000 

TCGGCGCTGC CTCTTTCACC GCCCAGGTCA TCGTCCTTAA CCACCCCGGT 1050 

CAGGTCGGTG CCGGCTACGC TCCCGTCCTC GATTGCCACA CTGCCCACAT 1100 

TGCCTGCAAG TTCGCCGAGC TCCTCGAGAA GATCGATCGC CGTACTGGTA 1150 

AGGCTGTTGA GACTTCTCCC AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

GTCAAGATGA TTCCCTCCAA GCCCATGTGC GTCGAGGCTT TCACCGACTA 1250 

CCCTCCCCTC GGTCGTTTCG CCGTCCGTGA CATGCGTCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGGCCGTC GACAAGACCC AGGCTGTCGC TGGCAAGGTC 1350 

ACCAAGTCTG CTGCCAAGGC TGCCAAGAAG TAA 1383 



2) INFORMATION FOR SEQ ID NO: 1998 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Trichoderma reesei 
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35 



(B) STRAIN: QM9414 

(C) ACCESSION NUMBER: Z23012 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1998 



ATGGGTAAGG AGGACAAGAC TCACATCAAC GTGGTCGTCA TCGGCCACGT 50 

CGACTCCGGC AAGTCTACCA CCACTGGTCA CTTGATCTAC CAGTGCGGTG 100 

GTATCGACAA GCGTACCATT GAGAAGTTCG AGAAGGAAGC CGCCGAACTC 150 

GGCAAGGGTT CCTTCAAGTA CGCGTGGGTT CTTGACAAGC TCAAGGCCGA 200 

10 GCGTGAGCGT GGTATCACCA TCGACATTGC CCTCTGGAAG TTCGAGACTC 250 

CCAAGTACTA TGTCACCGTC ATTGACGCTC CCGGCCACCG TGACTTCATC 300 

AAGAACATGA TCACTGGTAC TTCCCAGGCC GACTGCGCTA TCCTCATCAT 350 

CGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CCCGTGAGCA CGCTCTGCTC GCCTACACCC TGGGTGTCAA GCAGCTCATC 450 

15 GTCGCCATCA ACAAGATGGA CACTGCCAAC TGGGCCGAGG CTCGTTACCA 500 

GGAAATCATC AAGGAGACTT CCAACTTCAT CAAGAAGGTC GGCTTCAACC 550 

CCAAGGCCGT TGCTTTCGTC CCCATCTCCG GCTTCAACGG TGACAACATG 600 

CTCACCCCCT CCACCAACTG CCCCTGGTAC AAGGGCTGGG AGAAGGAGAC 650 

CAAGGCTGGC AAGTTCACCG GCAAGACCCT CCTTGAGGCC ATCGACTCCA 700 

20 TCGAGCCCCC CAAGCGTCCC ACGGACAAGC CCCTGCGTCT TCCCCTCCAG 750 

GACGTCTACA AGATCGGTGG TATCGGAACA GTTCCCGTCG GCCGTATCGA 800 

GACTGGTGTC CTCAAGCCCG GTATGGTCGT TACCTTCGCT CCCTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTCGCTGAG 900 

GGCCAGCCTG GTGACAACGT TGGTTTCAAC GTGAAGAACG TTTCCGTCAA 950 

25 GGAAATCCGC CGTGGCAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCCA 1000 

TGGGCGCCGC TTCTTTCACC GCCCAGGTCA TCGTCATGAA CCACCCCGGC 1050 

CAGGTCGGTG CCGGCTACGC CCCCGTCCTC GACTGCCACA CTGCCCACAT 1100 

TGCCTGCAAG TTCGCCGAGC TCCTCGAGAA GATCGACCGC CGTACCGGTA 1150 

AGGCTACCGA GTCTGCCCCC AAGTTCATCA AGTCTGGTGA CTCCGCCATC 1200 

30 GTCAAGATGA TCCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACCGACTA 1250 

CCCTCCCCTG GGTCGTTTCG CCGTCCGTGA CATGCGCCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGGCCGTC GAGAAGTCCT CTGCCGCCGC CGCCAAGGTC 1350 

ACCAAGTCCG CTGCCAAGGC CGCCAAGAAA TAA 1383 



2) INFORMATION FOR SEQ ID NO: 1999 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1999 
CATGTCAAYA TTGGTACTAT TGGTCATGT 29 

50 

2) INFORMATION FOR SEQ ID NO: 2000 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2000 
CCACCYTCIC TCAMGTTGAA RCGTT 



2) INFORMATION FOR SEQ ID NO: 2001 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2001 
ACYACITTRA CIGCYGCYAT YAC 



2) INFORMATION FOR SEQ ID NO: 2002 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2002 
CCIGARGARA GAGCIMGWGG T 



2) INFORMATION FOR SEQ ID NO: 2003 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2003 

CATYTCRAIR TTGTCACCTG G 21 
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2) INFORMATION FOR SEQ ID NO: 2004 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1360 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: SC5314 

(C) ACCESSION NUMBER: Genome project 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2004 

GCTGCCTTCG ACCGTTCTAA ACCTCATGTC AACATTGGTA CTATTGGTCA 50 

TGTTGATCAT GGTAAAACTA CATTGACTGC TGCTATCACC AAAGTTTTAG 100 

CCGAACAAGG TGGTGCCAAC TTCTTGGATT ATGGTTCTAT TGATAGAGCT 150 

CCAGAAGAAA GAGCTAGAGG TATCACTATT TCCACTGCCC ACGTTGAATA 200 

CGAAACCAAG AACAGACACT ATGCCCACGT TGATTGTCCA GGACACGCTG 250 

ATTATATCAA AAATATGATT ACTGGTGCCG CTCAAATGGA TGGTGCTATC 300 

ATTGTTGTTG CTGCCACTGA TGGTCAAATG CCTCAAACCA GAGAACATTT 350 

GTTATTGGCC AGACAAGTTG GTGTTCAAGA CTTGGTTGTG TTTGTCAACA 4 00 

AAGTCGATAC TATTGATGAC CCTGAAATGT TGGAATTAGT CGAAATGGAA 450 

ATGAGAGAAT TGTTATCCAC CTACGGTTTT GATGGTGACA ACACTCCAGT 500 

TATTATGGGA TCTGCTTTAA TGGCTTTGGA AGACAAGAAA CCAGAAATTG 550 

GTAAGGAAGC TATCTTGAAA TTGTTAGATG CTGTCGATGA ACACATTCCA 6 00 

ACTCCATCAA GAGACTTGGA ACAACCATTT TTGTTACCAG TTGAAGACGT 650 

GTTCTCCATC TCCGGTAGAG GAACTGTTGT CACTGGTAGA GTTGAAAGAG 7 00 

GTGTTTTGAA GAAGGGTGAA GAAATCGAAA TTGTTGGTGG TTTTGACAAA 750 

CCTTACAAGA CTACTGTTAC CGGTATTGAA ATGTTCAAAA AAGAATTAGA 8 00 

CTCTGCTATG GCTGGTGACA ACTGTGGTGT TTTGTTAAGA GGTGTTAAAA 8 50 

GAGATGAAAT CAAGAGAGGT ATGGTTTTGG CCAAACCAGG TACTGCTACT 900 

TCTCACAAGA AGTTCTTGGC TTCCTTGTAT ATTTTGACTT CCGAAGAAGG 950 

TGGTCGTTCC ACT CC ATTTG GTGAAGGTTA CAAGCCTCAA TGCTTCTTCA 1000 

GAACTAACGA TGTCACTACC ACATTTTCAT TCCCAGAAGG AGAAGGTGTT 1050 

GATCATTCTC AAATGATCAT GCCAGGTGAC AACATTGAAA TGGTTGGTGA 1100 

ATTGATCAAA TCTTGTCCAT TAGAAGTCAA CCAACGTTTC AACTTGAGAG 1150 

AAGGTGGTAA AACTGTTGGT ACTGGTTTGA TTACCAGAAT CATCGAATAA 1200 

ACAGAATGTG CACTGTGAAT AATAAAAAGA AAAGAGGTAT ATATAGGTGA 1250 

CTTTGTATTT TGTATTGAAC AATAAAATTC TGTAAATAGT AAGGGCCTCA 13 00 

GAAGTTTTGA TTTGATTTAT GCCATGTGGA CTTGTAGAGA TATCCTTCTC 1350 

AAACTTCTTG 13 60 



2) INFORMATION FOR SEQ ID NO: 2005 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1342 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Schizosaccharomyces pombe 
(C) ACCESSION NUMBER: Genome project 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2005 

AAGCCGCATG TCAATATTGG TACTATTGGT CATGTTGACC ACGGTAAAAC 50 

GACGTTGACG GCTGCTATTA CTAAATGCCT TTCTGATCTT GGTCAAGCTA 100 

GTTTTATGGA TTATAGTCAA ATTGACAAGG CCCCCGAGGA AAAGGCACGT 150 

GGTATTACCA TTTCATCTGC CCATGTTGAA TACGAAACTG CTAATCGTCA 200 

CTATGCCCAT GTGGATTGTC CTGGTCACGC CGATTACATT AAGAATATGA 250 

TTACTGGTGC TGCTACAATG GATGGCGCTA TCATTGTTGT TTCTGCTACC 300 

GATGGTCAAA TGCCTCAAAC TCGTGAACAT TTGCTTCTGG CTCGTCAAGT 350 

CGGTGTAAAG CAAATTGTTG TATACATCAA TAAAGTCGAT ATGGTCGAGC 4 00 

CTGATATGAT CGAGCTTGTC GAAATGGAAA TGCGTGAGCT ACTCTCCGAA 4 50 

TACGGATTTG ATGGTGACAA TACTCCAATT GTTAGCGGCA GTGCTTTATG 500 

TGCCTTAGAG GGTCGTGAGC CTGAGATTGG TCTCAATAGT ATTACTAAAT 550 

TGATGGAAGC TGTTGATAGT TATATTACTC TTCCTGAAAG AAAAACGGAT 600 

GTCCCTTTCT TGATGGCCAT CGAGGACGTT TTTTCAATTT CAGGTCGCGG 650 

AACTGTAGTC ACTGGCCGTG TCGAGCGCGG TACTTTAAAG AAGGGTGCTG 700 

AAATCGAAAT CGTCGGTTAT GGTAGCCATT TAAAGACTAC CGTTACTGGA 750 

ATTGAAATGT TCAAAAAGCA GCTTGATGCC GCCGTTGCCG GTGACAATTG 800 

TGGCCTTTTA CTTCGTTCTA TCAAGCGAGA GCAATTAAAA CGTGGAATGA 850 

TTGTCGCTCA ACCAGGAACC GTTGCTCCTC ATCAGAAATT CAAGGCATCA 9 00 

TTCTATATTT TGACAAAAGA GGAAGGAGGT CGTCGTACCG GTTTCGTTGA 950 

CAAGTATCGT CCCCAACTGT ACAGTCGTAC TTCCGACGTT ACTGTCGAAC 1000 

TTACCCACCC TGATCCTAAC GACTCAGACA AAATGGTTAT GCCTGGAGAC 1050 

AATGTCGAGA TGATCTGTAC GCTTATTCAC CCCATTGTCA TCGAAAAAGG 1100 

ACAACGCTTC ACAGTTCGTG AGGGTGGAAG CACTGTAGGC ACAGCTTTGG 1150 

TTACTGAACT TTTGGATTAG TGCATTTATG AACTTATTGG CTTTAAAAAT 1200 

TTTGCATGCT GAATACCAAT ATTATGTCCC TTCTCAGAAT TCTATAACTA 1250 

CAGTGTCATT ATTGTAATAA GACTTTTGCA TCCATTGACA ATGGTATTTG 1300 

ATACTTTTAT AGTTTCTACT ATTGTTAGCC AAAGTTATAA AA 1342 



2) INFORMATION FOR SEQ ID NO: 2006 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2006 
TGGAGCCGGT GAGCGTGG 



2) INFORMATION FOR SEQ ID NO: 2007 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2007 
TGGAGCCAGT GAGCGTGG 



2) INFORMATION FOR SEQ ID NO: 2008 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2008 
TCTGGAGCCG ATGAGCGTG 



2) INFORMATION FOR SEQ ID NO: 2009 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2009 
CTGGAGCCAG TAAGCGTGG 



2) INFORMATION FOR SEQ ID NO: 2010 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

1046 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: KMK107 

(C) ACCESSION NUMBER: AF027199 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2010 

ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTAAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 4 00 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 4 50 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGACG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 6 00 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCAGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGC CAGAT 750 

GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



2) INFORMATION FOR SEQ ID NO: 2011 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: CLSiS L-491 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2011 

ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCTTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAGCT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 3 00 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 4 50 

CACAACATGG GGGATCATGT AACCCGCCTT GATCGTTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 
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TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 
ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 
GAGCCAGTAA GCGTGGATCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 
GGTAAGCCCT CCCGTATCGT AGTTATCTAC ATGACGGGGA GTCAGGCAAC 
TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 
AGCATTGGTA A 



650 
700 
750 
800 
850 
861 



2) INFORMATION FOR SEQ ID NO: 2012 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single' 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2012 
CCGCGGATTA TTAAACCGCC CTTCCGCGG-MR-HEG-ATGTCAGAGG GATAGATCCA 4 9 



2) INFORMATION FOR SEQ ID NO: 2013 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorbata 

(B) STRAIN: ATCC 33433 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2013 

AGCTTAAGAA CTCTTATCTG GATTACGCGA TGTCGGTCAT TGTTGGCCGT 50 

GCGCTGCCGG ATGTCCGAGA TGGCCTGAAG CCGGTACACC GTCGCGTACT 100 

TTACGCCATG AACGTATTGG GCAATGACTG GAACAAAGCC TACAAAAAAT 150 

CAGCCCGTGT CGTGGGTGAC GTGATCGGTA AATATCACCC GCATGGTGAT 200 

ACTGCCGTCT ATGACACTAT CGTCCGTATG GC ACAGC CAT TCTCACTGCG 250 

ATACATGCTG GTAGATGGTC AAGGTAACTT CGGTTCTGTC GATGGCGACT 300 

CCGCCGCAGC GATGCGTTAT ACGGAAATCC GTATGTCGAA AATCGCCCAT 350 

GAG 353 



2) INFORMATION FOR SEQ ID NO: 2014 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS:* Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera georgiana 

(B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2014 

AGCTCCTATC TGGATTATGC GATGTCGGTC ATTGTTGGCC GTGCGCTGCC 50 

AGATGTCCGA GATGGCCTGA AGCCGGTACA CCGTCGCGTA CTTTACGCCA 100 

TGAACGTACT AGGCAATGAC TGGAACAAAG CCTATAAAAA ATCTGCCCGT 150 

GTCGTTGGTG ACGTAATCGG TAAATACCAT CCCCATGGTG ACTCGGCGGT 200 

CTATGACACG ATCGTCCGCA TGGCGCAGCC ATTCTCGCTG CGTTATATGC 250 

TGGTAGACGG TCAGGGTAAC TTCGGTTCTA TCGACGGCGA CTCTGCGGCG 300 

GCAATGCGTT ATACGGAAAT CCGTCTGGCG AAAATTGCCC ATGAACTGAT 350 

GGCCG 355 



2) INFORMATION FOR SEQ ID NO: 2015 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2015 
CCAAGAAGCT CAAAAACATC TG 



2) INFORMATION FOR SEQ ID NO: 2016 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid . 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2016 
TADCCTGTCC AWACAGCCAT 



2) INFORMATION FOR SEQ ID NO: 2017 

<i) SEQUENCE CHARACTERISTICS: 

1049 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2017 
ACTTTGAATA AGGTCGGTCT AG 22 



2) INFORMATION FOR SEQ ID NO: 2018 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2018 
ACACTAAACA AGGTTGGTTT AG 22 



2) INFORMATION FOR SEQ ID NO: 2019 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2019 
ACACTAAACA AGGTCGGTCT AG 22 



2) INFORMATION FOR SEQ ID NO: 2020 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2020 
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2) INFORMATION FOR SEQ ID NO: 2021 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2021 
GTAGCTCCAG ACGAAATGTT TG 



2) INFORMATION FOR SEQ ID NO: 2022 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2022 
GTAGCTCCAG ATGAAACGTT TG 



2) INFORMATION FOR SEQ ID NO: 2023 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2023 
GTAACTCCAG ATGAAATGTT TG 



2) INFORMATION FOR SEQ ID NO: 2024 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2024 
AGTGAAAAGA TGGCTGCTGC 



2) INFORMATION FOR SEQ ID NO: 2025 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2025 
AGTGAGAAAA TGGCTGCTGC 



2) INFORMATION FOR SEQ ID NO: 2026 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2026 
TCCAAGCATG CATTATGCAA ACG 



2) INFORMATION FOR SEQ ID NO: 2027 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2027 
TCGGTCTAGA TAGAGCTAAA ACG 

1052 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



2) INFORMATION FOR SEQ ID NO: 2028 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2028 
TATGCTCTTC AACAATCACG 



2) INFORMATION FOR SEQ ID NO: 2029 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2029 
AGCCGTTGAG ACTTTGAATA AG 



2) INFORMATION FOR SEQ ID NO: 2030 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2030 
CTTAATGGTC TTGGTATCG 



2) INFORMATION FOR SEQ ID NO: 2031 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2031 
CGTGACTGGG GTTCTGCTAT GA 



2) INFORMATION FOR SEQ ID NO: 2032 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 22 bases 

(B) TYPE: . Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2032 
CGTGACTGGG GATCATCAAT GA 



2) INFORMATION FOR SEQ ID NO: 2033 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2033 
CGTGACTGGG GTTCTGCCAT GA 



2) INFORMATION FOR SEQ ID NO: 2034 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2034 
ATCAAGAACA CTGGCTATGT AG 



1054 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 2035 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION:' SEQ ID NO: 2035 
ATCAAGAACA CTGGCTACGT AG 22 



2) INFORMATION FOR SEQ ID NO: 2036 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2036 
ATCAAGAACA CTGGTTACGT AG 22 



2) INFORMATION FOR SEQ ID NO: 2037 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2037 
ATCAAAAATA CTGGTTATGT AG 22 



2) INFORMATION FOR SEQ ID NO: 2038 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2038 

ATCAAGAATA CTGGCTACGT AG 22 
2) INFORMATION FOR SEQ ID NO: 2039 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2039 
ATCAAAAACA CTGGCTATGT AG 22 



2) INFORMATION FOR SEQ ID NO: 2040 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2040 
TGTGACCCCA GACAAACCC 19 



2) INFORMATION FOR SEQ ID NO: 2041 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2041 
GTTGAGCGGC AGCACTATCT 20 



2) INFORMATION FOR SEQ ID NO: 2042 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2042 
CACGGGGATT TCTCTATTTA 20 



2) INFORMATION FOR SEQ ID NO: 2043 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2043 
CACGGGGATT ACTCTATTTA 20 



2) INFORMATION FOR SEQ ID NO: 2044 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2044 
ACCGTAAGTC GGCCAAGTCA 20 



2) INFORMATION FOR SEQ ID NO: 2045 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2045 
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GTTCTTTCTC CGTATCGTC 



2) INFORMATION FOR SEQ ID NO: 2046 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2046 
ACGGGGATTT TTCTATCTAT 



2) INFORMATION FOR SEQ ID NO: 2047 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: CS109 

(C) ACCESSION NUMBER: Z49094 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2047 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTATGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACCTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 

TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 250 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 300 

ATTGATACCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGCAA 350 

TTCCCTCCAA GGTGGATCAA CTCTCACCCA ACAGTTGATT AAATTGACTT 4 00 

ACTTTTCAAC CTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 4 50 

GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAAC AGGAAATCTT 500 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGCAAC TATGGAATGC 550 

AGACAGCAGC TCAAAACTAC TATGGTAAAG ACCTCAATAA TTTAAGTTTA 600 

CCTCAGTTAG CCTTGCTGGC TGGAATGCCT CAGGCACCAA ACCAATATGA 650 

CCCCTATTCA CATCCAGAAG CAGCCCAAGA CCGCCGAAAC TTGGTCTTAT 700 

CTGAAATGAA AAATCAAGGT TACATCTCTG CTGAACAGTA TGAGAAAGCA 750 

GTCAATACAC CAATTACTGA TGGACTACAA AGTCTCAAAT CAGCAAGTAA 800 

TTACCCTGCT TACATGGATA ATTACCTCAA GGAAGTCATC AATCAAGTTG 850 

AAGAAGAAAC TGGCTATAAC CTTCTAACTA CTGGGATGGA TGTTTACACA 900 
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AATGTAGACC AAGAAGCTCA AAAACATCTG 
TCAATACGTC TCTTACCCTG ACGATGATTT 
TAGATGTTTC AAATGGTAAA GTCATCGCAC 
GCAAGTAATG TTTCATTCGG TACCAACCAG 
CTGGGGATCA TCAATGAAAC CAATCACTGA 
ATGGAGTCTA TGACTCTACT GCTTCTATTG 
TATCCTGGCA CTGATACTCC ACTCTACAAC 
AAACATTACA ATCCAGTATG CTCTTCAACA 
TTGAGACTTT GAATAAGGTC GGTCTAGATA 
GGTCTTGGTA TCGACTATCC AAGCATGCAT 
TAACACAACT GAATCCAACA AAAAATATGG 
CTGCTGCCTA CGCTGCTTTT GCTAATGGTG 
TATATCAATA AAATCGTCTT TAGTGATGGT 
TGCTGGTACA CGAGCTATGA AAGAGACTAC 
TGATGAAAAC TGTTTTAACT TACGGAACAG 
TGGCTTCCAC AAGCAGGTAA GACAGGTACT 
AATTGAAAAG TATATCAAGA ACACTGGTTA 
TTGTAGGGTA TACCCGTAAA TATGCAATGG 
AATCGTCTAA CTCCAATCAT CGGAGATGGT 
CTATCGTTCA ATGATAACTT ACCTTTCTGA 
GGACAATGCC AGATGGCTTG TATAGAAATG 
GGTGCTCGTT CTACGTGGAG CTCACCTGCT 
TGAAAGTTCA AGCTCATCAT CAGATAGTTC 
CCACTCCAAG CACAAATAAT AGTACGACTA 
CAACAATCAA ATACAACCCC TGATCAACAA 
ACAACCATAA 



PCT/CA00/01150 

TGGGATATCT ACAACTCCGA 950 

GCAAGTCGCA TCTACGGTCG 1000 

AACTTGGTGC TCGTCATCAA 1050 

GCCGTAGAAA CCAATCGTGA 1100 

CTATGCTCCC GCTTTAGAAT 1150 

TACATGATGT CCCTTATAAC 1200 

TGGGATCATG TCTACTTTGG 1250 

ATCACGAAAT GTCACAGCCG 13 00 

GAGCTAAAAC CTTCCTTAAT 1350 

TATGCAAACG CCATTTCAAG 14 00 

TGCAAGTAGT GAAAAAATGG 14 50 

GTATTTATCA CAAACCAATG 1500 

AGCGAAAAAG AATTTTCTGA 1550 

TGCCTATATG ATGACTGAAA 1600 

GACGTGGAGC CTACCTACCA 1650 

TCTAACTATA CTGACGAAGA 1700 

CGTAGCTCCA GATGAAATGT 1750 

CTGTTTGGAC AGGATACTCA 1800 

TTCCTTGTTG CTGGTAAAGT 1850 

AGATGACCAA CCTGGAGATT 1900 

GAGAATTCGT ATTTAAAAAT 1950 

CCACAACAAC CCCCATCAAC 2000 

AACTTCACAG TCTAACTCAA 2050 

CCAATCCTAA CAATAATACG 2100 

AATCAGAATC CTCAACCAGC 2150 

2160 



2) INFORMATION FOR SEQ ID NO: 2048 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: R6 

(C) ACCESSION NUMBER: M90527 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2048 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTACGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACTTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 

TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 2 50 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 300 

ATTGATACCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGCAA 350 

TTCCCTCCAA GGTGGATCAG CTCTCACTCA ACAGTTGATT AAGTTGACTT 4 00 

ACTTTTCAAC TTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 4 50 

GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAGC AAGAAATCTT 500 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGGAAC TATGGAATGC 550 
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AGACAGCAGC TCAAAACTAC 
CCTCAGTTAG CCTTGCTGGC 
CCCCTATTCA CATCCAGAAG 
CTGAAATGAA AAATCAAGGC 
GTCAATACAC CAATTACTGA 
TTACCCTGCT TACATGGATA 
AAGAAGAAAC AGGCTATAAC 
AATGTAGACC AAGAAGCTCA 
CGAATACGTT GCCTATCCAG 
TTGATGTTTC TAACGGTAAA 
TCAAGTAATG TTTCCTTCGG 
CTGGGGATCA ACTATGAAAC 
ACGGTGTCTA CGAGTCAACT 
TACCCTGGGA CAAATACCCC 
CAACATCACC TTGCAATACG 
TGGAAACTCT AAACAAGGTC 
GGTCTCGGAA TCGACTACCC 
TAACACAACC GAATCAGACA 
CTGCTGCTTA CGCTGCCTTT 
TATATCCATA AAGTCGTCTT 
TGTCGGAACT CGTGCCATGA 
TGATGAAAAC AGTCTTGAGT 
TGGCTCCCTC AGGCTGGTAA 
AATTGAAAAC CACATCAAGA 
TTGCTGGCTA TACGCGTAAA 
AACCGTCTGA CACCACTTGT 
TTACCGCTCT ATGATGACCT 
GGAATATACC AGAGGGGCTC 
GGTGCTCGTT CTACGTGGAG 
TGAAAGTTCA AGCTCATCAT 
CCACTCCAAG CACAAATAAT 
CAACAATCAA ATACAACCCC 
ACAACCATAA 



TATGGTAAAG ACCTCAATAA 
TGGAATGCCT CAGGCACCAA 
CAGCCCAAGA CCGCCGAAAC 
TACATCTCTG CTGAACAGTA 
TGGGCTACAA AGTCTCAAAT 
ATTACCTCAA GGAAGTCATC 
CTACTCACAA CTGGGATGGA 
AAAACATCTG TGGGATATTT 
ACGATGAATT GCAAGTCGCT 
GTCATTGCCC AGCTAGGAGC 
AATTAACCAA GCAGTAGAAA 
CGATCACAGA CTATGCTCCT 
GCCACTATCG TTCACGATGA 
TGTTTATAAC TGGGATAGGG 
ccctgcaaca' ATCGCGAAAC 
GGACTCAACC GCGCCAAGAC 
AAGTATTCAC TACTCAAATG 
AAAAATATGG AGCAAGTAGT 
GCAAATGGTG GAACTTACTA 
TAGTGATGGG AGTGAAAAAG 
AAGAAACGAC AGCCTATATG 
TATGGAACTG GACGAAATGC 
AACAGGAACC TCTAACTATA 
CCTCTCAATT TGTAGCACCT 
TATTCAATGG CTGTATGGAC 
AGGCAATGGC CTTACGGTCG 
ACCTGTCTGA AGGAAGCAAT 
TACAGAAATG GAGAATTCGT 
CTCACCTGCT CCACAACAAC 
CAGATAGTTC AACTTCACAG 
AGTACGACTA CCAATCCTAA 
TGATCAACAA AATCAGAATC 



TTTAAGTTTA 600 

ACCAATATGA 650 

TTGGTCTTAT 700 

TGAGAAAGCA 750 

CAGCAAGTAA 800 

AATCAAGTTG 850 

TGTCTACACA 900 

ACAATACAGA 950 

TCTACCATTG 1000 

ACGCCATCAG 1050 

CAAACCGCGA 1100 

GCCTTGGAGT 1150 

GCCCTATAAC 1200 

GCTACTTTGG 1250 

GTCCCAGCCG 13 00 

TTTCCTAAAT 1350 

CCATTTCAAG 14 00 

GAAAAGATGG 1450 

TAAACCAATG 1500 

AGTTCTCTAA 1550 

ATGACCGACA 1600 

CTATCTTGCT 1650 

CAGACGAGGA 1700 

GATGAACTAT 1750 

AGGCTATTCT 1800 

CTGCCAAAGT 1850 

CCAGAGGATT 1900 

ATTTAAAAAT 1950 

CCCCATCAAC 2000 

TCTAGCTCAA 2050 

CAATAATACG 2100 

CTCAACCAGC 2150 
2160 



2) INFORMATION FOR SEQ ID NO: 2049 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: URU-E159 

(C) ACCESSION NUMBER: AF139890 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2049 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 5 0 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT- AGGCGGAGGA GTTTTTTTCT 100 

ACTACGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACTTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 
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TTCTGAACGC CGCGTCAATG 
TTAAGGCAAT CGTTTCTATC 
ATTGATTCCA TCCGTATCCT 
TTCCCTCCAA GGTGGATCAA 
ACTTTTCAAC CTCGACTTCC 
GCTTGGTTAG CGATTCAGTT 
GACCTACTAT ATAAATAAGG 
AGACAGCAGC TCAAAACTAC 
CCTCAGTTAG CCTTGCTGGC 
CCCCTATTCA CATCCAGAAG 
CTGAAATGAA AAATCAAGGT 
GTCAATACAC CAATTACTGA 
TTACCCTGCT TACATGGATA 
AACAAGAAAC TGGCTATAAC 
AATGTAGACC AAGAAGCTCA 
TCAATACGTC TCTTACCCTG 
TAGATGTTTC AAATGGTAAA 
GCAAGTAACG TTTCATTTGG 
CTGGGGTTCT GCTATGAAAC 
ACGGTGTTTA TGATTCCACT 
TATCCGGGAA CAAGCACACC 
TAATATTACT CTGCAATATG 
TTGAGACTTT GAATAAGGTC 
GGTCTTGGTA TCGACTATCC 
TAATACAACA GAATCTAATA 
CTGCTGCTTA TGCTGCCTTT 
TATATCCATA AAGTCGTCTT 
TGTCGGAACT CGTGCCATGA 
TGATGAAAAC AGTCTTGACT 
TGGCTTCCTC AAGCTGGTAA 
AGTTGAAAAC CACATCAAGA 
TTGTTGGTTA TACTCGTAAG 
AATCGTTTAA CTCCTATCGT 
TTATCGCTCA ATGATAACGT 
GGACGATGCC AGACGGACTT 
GGTGCTCGTT CTACGTGGAA 
TGAAAGTTCA AGCTCATCAT 
CCACTCCAAG CACAAATAAT 
CAACAATCAA ATACAACCCC 
ACAACCATAA 



CCCAAGCTAA TGATATTCCC 
GAAGACCATC GCTTCTTCGA 
GGGAGCTTTC TTGCGCAATC 
CTCTCACCCA ACAGTTGATT 
GACCAGACTA TTTCTCGTAA 
AGAACAAAAA GCAACCAAAC 
TCTACATGTC TAATGGCAAC 
TATGGTAAAG ACCTCAATAA 
TGGAATGCCT CAGGCACCAA 
CAGCCCAAGA CCGCCGAAAC 
TACATCTCTG CTGAACAGTA 
TGGACTACAA AGTCTCAAAT 
ATTACCTCAA GGAGGTCATC 
CTTCTAACTA CTGGGATGGA 
AAAACATCTG TGGGATATCT 
ACGATGATTT GCAAGTCGCA 
GTCATCGCCC AACTTGGAGC 
TACCAACCAA GCTGTGGAAA 
CAATCACCGA TTATGCACCT 
GCAACTATGG TTAATGATAT 
TGTCTACAAC TGGGATAGAG 
CTCTTCAACA ATCACGAAAT 
GGTCTAGATA GAGCTAAAAC 
AAGCATGCAT TATGCAAACG 
AACAATACGG AGCAAGTAGT 
GCAAATGGTG GCACTTACTA 
CAGTGATGGA AGTAAAAAAG 
AGGAAACGAC AGCCTATATG 
TATGGAACTG GGCGTGGAGC 
AACAGGAACC TCTAACTATA 
ACACTGGCTA TGTAGCTCCA 
TATTCTATGG CTGTATGGAC 
TGGAGATGGT TTCCTAGTTG 
ATCTATCAGA AGATACTCAT 
TTCAGAAATG GAGAATTCGT 
CTCACCTGCT CCACAACAAC 
CAGATAGTTC AACTTCACAG 
AGTACGACTA CCAATCCTAA 
TGATCAACAA AATCAGAATC 



ACAGATTTGG 


250 


CCACAGGGGG 


300 


TGCAAAGTAA 


350 


AAGTTGACTT 


400 


GGCTCAGGAA 


450 


AGGAAATCTT 


500 


TATGGAATGC 


550 


TTTAAGTTTA 


600 


ACCAATATGA 


650 


TTGGTCTTAT 


700 


TGAGAAAGCA 


750 


CAGCAAGTAA 


800 


AATCAAGTAG 


850 


TGTTTACACA 


900 


ACAACTCCGA 


950 


TCTACGGTCG 


1000 


TCGTCACCAA 


1050 


CCAATCGTGA 


1100 


GCCATAGAAT 


1150 


TCCTTATAAC 


1200 


CATATTTCGG 


1250 


GTCACAGCCG 


1300 


CTTCCTTAAT 


1350 


CCATTTCAAG 


1400 


GAAAAAATGG 


1450 


TAAACCAATG 


1500 


AGTTCTCTAA 


1550 


ATGACCGACA 


1600 


CTATCTTCCT 


1650 


CAGATGAGGA 


1700 


GATGAAATGT 


1750 


AGGTTATTCG 


1800 


CAGCTAAAGT 


1850 


CCAGAAGACT 


1900 


ATTTAAAAAT 


1950 


CCCCATCAAC 


2000 


TCTAGCTCAA 


2050 


CAATAATACG 


2100 


CTCAACCAGC 


2150 




2160 



2) INFORMATION FOR SEQ ID NO: 2050 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 8303; 35193 

(C) ACCESSION NUMBER: AF046230 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2050 



CTGTGGGATA 
TTTGCAAGTC 
CCCAACTTGG 
CAAGCTGTGG 
CGATTATGCA 
TGGTTAATGA 
AACTGGGATA 
ACAATCACGA 
ATAGAGCTAA 
CATTATGCAA 
CGGAGCAAGT 
GTGGCACTTA 
GGAAGTAAAA 
GACAGCCTAT 
CTGGGCGTGG 
ACCTCTAACT 
CTATGTAGCT 
TGGCTGTATG 
GGTTTCCTAG 



TCTACAACTC 
GCATCTACGG 
AGCTCGTCAC 
AAACCAATCG 
CCTGCCATAG 
TATTCCTTAT 
GAGCATATTT 
AATGTCACAG 
AACCTTCCTT 
ACGCCATTTC 
AGTGAAAAAA 
CTATAAACCA 
AAGAGTTCTC 
ATGATGACCG 
AGCCTATCTT 
ATACAGATGA 
CCAGATGAAA 
GACAGGTTAT 
TTGCAGCTAA 



CGATCAATAC GTCTCTTACC CTGACGATGA 
TCGTAGATGT TTCAAATGGT AAAGTCATCG 
CAAGCAAGTA ACGTTTCATT TGGTACCAAC 
TGACTGGGGT TCTGCTATGA AACCAATCAC 
AATACGGTGT TTATGATTCC ACTGCAACTA 
AACTATCCGG GAACAAGCAC ACCTGTCTAC 
CGGTAATATT ACTCTGCAAT ATGCTCTTCA 
CCGTTGAGAC TTTGAATAAG GTCGGTCTAG 
AATGGTCTTG GTATCGACTA TCCAAGCATG 
AAGTAATACA ACAGAATCTA ATAAACAATA 
TGGCTGCTGC TTATGCTGCC TTTGCAAATG 
ATGTATATCC ATAAAGTCGT CTTCAGTGAT 
TAATGTCGGA ACTCGTGCCA TGAAGGAAAC 
ACATGATGAA AACAGTCTTG ACTTATGGAA 
CCTTGGCTTC CTCAAGCTGG TAAAACAGGA 
GGAAGTTGAA AACCACATCA AGAACACTGG 
TGTTTGTTGG TTATACTCGT AAGTATTCTA 
TCGAATCGTT TAACTCCTAT CGTTGGAGAT 
AGTTTATCGC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
930 



2) INFORMATION FOR SEQ ID NO: 2051 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 63509; Mil 

(C) ACCESSION NUMBER: AF04 6238 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2051 

CTGTGGGATA TTTACAATAC AGACGAATAC GTTGCCTATC CAGATGATGA 50 

CATGCAAGTA GCTTCGACTA TTGTAGATGT ATCTAATGGT AACGTTATTG 100 

CACAACTTGG TGCTCGTCAT CAAGCAAGTA ATGTTTCATT CGGCACCAAC 150 

CAGGCTGTGG AGACCAATCG TGACTGGGGT TCTTCTATGA AACCAATCAC 200 

TGACTATGCT CCCGCTTTAG AATATGGAGT CTATGACTCT ACTGCTTCTA 250 

TTGTACATGA TGTTCCTTAT AACTATCCTG GCACTGATAC TCCAGTCTAC 300 

AACTGGGATC ATGTCTACTT TGGAAACATT ACAATCCAGT ATGCTCTTCA 350 

ACAATCACGA AATGTCACAG CCGTTGAGAC TTTGAATAAG GTCGGTCTAG 400 

ATAGAGCTAA AACCTTCCTT AATGGTCTTG GTATCGACTA TCCAAGCATG 450 

CATTATGCAA ACGCCATTTC AAGTAACACA ACTGAATCCA ACAAACAGTA 500 

CGGTGCAAGT AGTGAAAAGA TGGCTGCTGC TTATGCCGCC TTTTCTAATG 550 

GTGGTATTTA CCACAAACCA ATGTATATCA ATAAAATCGT CTTCAGTGAT 600 

GGTAGTGAAA AAGAATTTTC TGATGCTGGT ACACGAGCTA TGAAAGAAAC 650 

TACTGCCTAT ATGATGACCG AAATGATGAA AACTGTTTTA GCTTACGGAA 700 

CCGGACGTGG TGCTTACCTA CCATGGCTTC CACAAGCAGG TAAGACAGGT 750 
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ACTTCTAACT ATACTGACGA 
CTACGTAGCT CCAGATGAAA 
TGGCTGTATG GACTGGTTAC 
GGTTTCCTAG TTGCTGCTAA 



CGAAATTGAA AAGTATATCA 
TGTTTGTTGG TTATACTCGT 
TCAAATCGTT TAACTCCAAT 
GGTTTATCGC 



PCT/CA00/0115O 

AGAACACTGG 800 
AAGTATTCTA 850 
CGTAGGAGAT 900 
930 



2) INFORMATION FOR SEQ ID NO: 2052 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1195 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear' 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: #22/HA5 

(C) ACCESSION NUMBER: AB006877 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2052 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 50 

TTCCAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 100 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 150 

TCAACAATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 200 

ATATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 250 

GAACAAGCAC ACCTGTCTAC AACTGGGATC GAGCATATTT TGGTAATATT 300 

ACTCTGCAAT ATGCCCTTCA ACAATCACGT AATGTTACAG CAGTTGAAAC 350 

ATTAAACAAG GTTGGTTTAG ATCGAGCCAA AACTTTCCTA AATGGCTTAG 4 00 

GTATCGATTA CCCAAGCATA CACTATGCTA ACGCCATTTC AAGTAATACA 4 50 

ACAGAATCTA ACAAACATTA CGGTGCAAGT AGTGAAAAAA TGGCTGCTGC 500 

TTATGCCGCC TTTGCTAATG GTGGTATTTA CCACAAACCA ATGTATATCA 550 

ATAAAATCGT CTTTAGTGAT GGTAGTGAGA AAGAATTTTC TGATGCTGGT 6 00 

ACACGAGCTA TGAAAGAAAC TACTGCCTAT ATGATGACCG AAATGATGAA 650 

AACTGTTTTA GCTTACGGAA CCGGACGTGG TGCTTACCTA CCATGGCTTC 700 

CACAAGCAGG TAAGACAGGT ACTTCTAACT ATACTGATGA CGAAATCGAA 750 

AAACATATCA AGAATACTGG CTACGTAGCT CCAGATGAAA TGTTTGTAGG 800 

CTATACTCGT AAGTATTCTA TGGCTGTATG GACTGGTTAC TCAAATCGTT 850 

TAACTCCAAT TGTTGGAGAT GGTTTCCTAG TTGCTGCTAA GGTTTATCGC 900 

TCAATGATAT CGTATCTATC AGAAGATGAC CAACCTGGAG ATTGGACGAT 950 

GCCAGACGGC CTGTTCCGAA ATGGAGAATT CGTATTTAAA AATGGTGCTC 1000 

GTTCTACGTG GAACTCACCT GCTCCACAAC AACCCCCATC AACTGAAAGT 1050 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 1100 

AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1150 

CAAATACAAC CCCTGATCAA CAAAATCAGA ATCCTCAACC AGCAC 1195 



2) INFORMATION FOR SEQ ID NO: 2053 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 17619 

(C) ACCESSION NUMBER: AF04 6237 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2053 

CTGTGGGATA TTTACAATAC AGACGAATAC GTTGCCTATC CAGACGATGA 50 

ATTGCAAGTC GCTTCTACCA TTGTTGATGT TTCTAACGGT AAAGTCATTG 100 

CCCAGCTAGG AGCACGCCAT CAGTCAAGTA ATGTTTCCTT CGGAATTAAC 150 

CAAGCAGTAG AAACAAACCG CGACTGGGGA TCAACTATGA AACCGATCAC 200 

AGACTATGCT CCTGCCTTGG AGTACGGTGT CTACGATTCA ACTGCTACTA 250 

TCGTTCACGA TGAGCCCTAT AACTACCCTG GGACAAATAC TCCTGTTTAT 300 

AACTGGGATA GGGGCTACTT TGGCAACATC ACCTTGCAAT ACGCCCTGCA 350 

ACAATCGCGA AACGTCCCAG CCGTGGAAAC TCTAAACAAG GTCGGACTCA 4 00 

ACCGCGCCAA GACTTTCCTA AATGGTCTCG GAATCGACTA CCCAAGTATT 4 50 

CACTACTCAA ATGCCATTTC AAGTAACACA ACCGAATCAG ACAAAAAATA 500 

TGGAGCAAGT AGTGAAAAGA TGGCTGCTGC TTACGCTGCC TTTGCAAATG 550 

GTGGAACTTA CTATAAACCA ATGTATATCC ATAAAGTCGT CTTTAGTGAT 600 

GGGAGTGAAA AAGAGTTCTC TAATGTCGGA ACTCGTGCCA TGAAGGAAAC 650 

GACAGCCTAT ATGATGACCG AAATGATGAA AACAGTCTTG AGTTATGGAA 7 00 

CTGGACGAAA TGCCTATCTT GCTTGGCTTC CTCAAGCTGG TAAAACAGGT 750 

ACCTCTAACT ATACAGATGA AGAAATTGAA AAACACATCA AAAACACTGG 800 

CTATGTAGCT CCAGATGAAA CGTTTGTTGG TTATACTCGT AAGTATTCTA 850 

TGGCTGTATG GACAGGTTAC ACAAACCGTC TCACACCAAT TGTGGGGGAT 900 

GGCTTCACAG TTGCTGCCAA AGTTTACCGC 930 



2) INFORMATION FOR SEQ ID NO: 2054 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 306 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: R6 

(C) ACCESSION NUMBER: Y07845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2054 

CGTCGCATTC TCTACGGAAT GAATGAATTG GGTGTGACCC CAGACAAACC 50 

CCATAAAAAA TCTGCTCGTA TTACAGGGGA TGTCATGGGT AAATACCACC 100 

CACACGGGGA TTCCTCTATT TATGAAGCCA TGGTCCGTAT GGCTCAATGG 150 

TGGAGCTACC GTTACATGCT TGTAGATGGT CATGGGAATT TTGGTTCCAT 200 

GGATGGAGAT AGTGCTGCCG CTCAACGTTA TACCGAGGCA CGTATGAGCA 250 

AGATTGCTCT GGAAATGCTT CGTGATATCA ACAAAAATAC AGTTGATTTC 300 

GTTGAT 306 
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2) INFORMATION FOR SEQ ID NO: 2055 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2472 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 7785 

(C) ACCESSION, NUMBER: Z67739 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2055 

ATGTCTAACA TTCAAAACAT GTCCCTGGAG GACATCATGG GAGAGCGCTT 50 
TGGTCGCTAC TCCAAGTACA TTATTCAAGA CCGGGCTTTG CCAGATATTC 100 
GTGATGGGTT GAAGCCGGTT CAGCGCCGTA TTCTTTATTC TATGAATAAG 150 
GATAGCAATA CTTTTGACAA GAGCTACCGT AAGTCGGCCA AGTCAGTCGG 2 00 

GAACATCATG GGGAATTTCC ACCCACACGG GGATTCTTCT ATCTATGATG 250 
CCATGGTTCG TATGTCACAG AACTGGAAAA ATCGTGAGAT TCTAGTTGAA 300 
ATGCACGGTA ATAACGGTTC TATGGACGGA GATCCTCCTG CGGCTATGCG 350 
TTATACTGAG GCACGTTTGT CTGAAATTGC AGGCTACCTT CTTCAGGATA 4 00 

TCGAGAAAAA GACAGTTCCT TTTGCATGGA ACTTTGACGA TACGGAGAAA 4 50 

GAACCAACGG TCTTGCCAGC AGCCTTTCCA AACCTCTTGG TCAATGGTTC 500 
GACTGGGATT TCGGCTGGTT ATGCCACAGA CATTCCTCCC CATAATTTAG 550 
CTGAGGTCAT AGATGCTGCA GTTTACATGA TTGACCACCC AACTGCAAAG 600 
ATTGATAAAC TCATGGAATT CTTACCTGGA CCAGACTTCC CTACAGGGGC 650 
TATTATTCAG GGTCGTGATG AAATCAAGAA AGCTTATGAG ACTGGGAAAG 700 
GGCGCGTGGT TGTTCGTTCC AAGACTGAAA TTGAAAAGCT AAAAGGTGGT 750 
AAGGAACAAA TCGTTATTAC TGAGATTCCT TATGAAATCA ATAAGGCCAA 800 
TCTAGTCAAG AAAATCGATG ATGTTCGTGT TAATAACAAG GTAGCTGGGA 8 50 

TTGCTGAGGT TCGTGATGAG TCTGACCGTG ATGGTCTTCG TATCGCTATC 900 
GAACTTAAGA AAGACGCTAA TACTGAGCTT GTTCTCAACT ACTTATTTAA 950 

GTACACCGAC CTACAAATCA ACTACAACTT TAATATGGTG GCGATTGACA 1000 

ATTTCACACC TCGTCAGGTT GGGATTGTTC CAATCCTGTC TAGCTACATC 10 50 

GCTCACCGTC GAGAAGTGAT TTTGGCGCGT TCACGCTTTG ACAAAGAAAA 1100 

GGCTGAGAAA CGTCTCCATA TCGTCGAAGG TTTGATTCGT GTGATTTCGA 1150 

TTTTGGATGA AGTCATTGCT CTTATCCGTG CTTCTGAGAA TAAGGCGGAC 12 00 

GCCAAGGAAA ACCTCAAAGT TAGCTATGAT TTTACGGAAG AACAGGCTGA 1250 

GGCTATCGTA ACTTTGCAAC TGTACCGTTT GACCAATACC GATGTGGTTG 1300 

TCTTGCAGGA AGAAGAAGCA GAGCTTCGTG AGAAGATTGC TATGCTGGCG 1350 

GCTATTATCG GTGATGAAAG GACTATGTAC AATCTCATGA AGAAAGAACT 1400 

TCGTGAGGTC AAGAAGAAAT TTGCAACTCC TCGTTTGAGT TCTTTAGAAG 1450 

ACACTGCGAA AGCAATTGAG ATTGATACAG CTAGTCTTAT CGCTGAGGAA 1500 

GATACCTACG TCAGCGTGAC CAAGGCAGGT TACATCAAGC GTACCAGTCC 1550 

ACGTTCCTTT GCGGCTTCCA CCTTGGAAGA AATTGGCAAG CGTGATGATG 1600 

ACCGTTTGAT TTTTGTTCAA TCTGCCAAGA CAACCCAGCA CCTCTTGATG 1650 

TTCACAAGTC TTGGAAATGT CATCTACAGA CCAATCCATG AGTTGGCAGA 1700 

TATTCGTTGG AAGGACATCG GAGAGCATCT GAGCCAAACC ATCACAAACT 1750 

TTGAAACGAA TGAAGAAATC CTTTATGTGG AAGTACTGGA TCAGTTTGAC 1800 

GATGCGACAA CCTACTTTGC AGTGACTCGC CTTGGTCAAA TCAAACGGGT 1850 
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CTATAGATCT AAGTCTGTCA 1900 

AGATTGTAGC AGTGGCTCCG 1950 

CAAAATGGTT ATGCCCTGCG 2000 

TGCTAAGGCA GCAGGTGTCA 2050 

TCCAATCTGG CTTTATCTGT 2100 

CGTGGAAGCT TGAAACGTGT 2150 

TGCCAAACGA GGATTACAAG 2200 

GTGTCTTCTT GGCAGGAGCA 2250 

TTCAGTACGG AAGTGGATGT 2300 

TAAAGGAACA ATCTATGAAA 2350 

GCACTAGCAA TGGAAGCTTC 2400 

TTTGACGCTT ATCTTCAGGA 2450 

2472 



2) INFORMATION FOR SEQ ID NO: 2056 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2056 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATTGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGCAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCAACA 200 

ATGAAACCAA TCACAGACTA TGCTCCTGCC TTGGAATACG GTGTCTACGA 250 

TTCAACTGCT ACTATCGTTC ACGATGAGCC CTATAACTAT CCTGGGACAG 300 

ATACCCCTGT CTATAACTGG GATAGGGGCT ACTTTGGCAA TATCACCTTG 350 

CAATACGCCC TGCAACAATC GCGAAACGTC CCAGCCGTGG AAACACTAAA 4 00 

CAAGGTCGGG CTCAACCGCG CCAAGACTTT TCTAAATGGT CTCGGAATCG 4 50 

ACTACCCAAG TATTCACTAC TCAAATGCCA TTTCAAGTAA CACAACCGAG 500 

TCAGACAAAA AATATGGAGC AAGTAGTGAA AAGATGGCTG CTGCTTACGC 550 

TGCCTTTGCA AATGGTGGAA CTTACTATAA ACCAATGTAT ATCCATAAAG 600 

TCGTCTTTAG TGATGGAAGT GAAAAAGAGT TCTCTAATGT CGGAACTCGC 650 

GCCATGAAAG AAACGACTGC TTACATGATG ACAGAAATGA TGAAAACAGT 700 

CTTGACGTAC GGAATTGGTC GTGGTGCCTA CCTGCCTTGG CTTCCTCAAG 750 

CTGGTAAAAC AGGTACTTCT AACTATACTG ACGAAGAAAT TGAAAAGTAT 800 

ATCAAGAACA CTGGTTACGT AACTCCAGAT GAAATGTTTG TAGGGTATAC 850 

CCGTAAATAT GCAATGGCTG TATGGACAGG CTATTCTAAC CGTCTGACAC 900 

CACTTGTAGG CGATGGCCTT ACGGTCGCTG CCAAGGTTTA CCGCTCTATG 950 

ATGACCTACC TGTCTGAAGG AAGCAATCCA GAGGATTGGA ATATACCAGA 1000 

GGGGCTCTAC AGAAATGGAG AATTCGTATT TAAAAATGGT GCTCGTTCTA 1050 

CGTGGAGCTC ACCTGCTCCA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

CAACCCCTGA TC 1212 
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2) INFORMATION FOR SEQ ID NO: 2057 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1242 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2057 



ATGTAGACCA AGAAGCTCAA AAACATCTGT GGGATATTTA CAATACAGAC 50 

GAATACGTTG CCTATCCAGA CGATGAATTG CAAGTCGCTT CTACCATTGT 100 

TGATGTTTCT AACGGGAAAG TTATTGCTCA GTTAGGTTCT CGTCACCAAT 150 

CAAGCAATGT TTCCTTCGGA ATCAACCAAG CTGTTGAAAC CAACCGTGAC 200 

TGGGGTTCTG CCATGAAGCC AATCACAGAC TATGCTCCTG CCTTAGAGTA 250 

TGACATCTAC GACTCAACTG CTTCGATTGT ACATGATGTT CCTTATAACT 300 

ATCCAGGTAC TGATACTCCC CTCTACAACT GGGATAAAGT CTACTTTGGA 350 

AATATTACAA TCCAGTATGC ACTTCAACAG TCACGTAATG TCACAGCCGT 400 

TGAGACTTTG AATAAGGTCG GTCTAGATAG AGCTAAAACC TTTCTTAATG 4 50 

GTCTTGGTAT CGACTATCCA AGCATGCATT ATGCAAACGC CATTTCAAGT 500 

AATACGACTG AGTCAAACAA AAAGTACGGA GCAAGTAGTG AGAAAATGGC 550 

TGCTGCTTAC GCTGCTTTTG CTAACGGTGG TATCTACCAT AAACCAATGT 6 00 

ATATCAACAA AATCGTCTTT AGCGATGGTA GCTCAAAAGA ATACGCTGAT 650 

CCTGGTACTC GTGCCATGAA AGAGACGACC GCCTATATGA TGACAGAAAT 700 

GATGAAGACT GTCTTGGCAT ACGGAACGGG TCGTGGTGCT TATCTCCCTT 750 

GGCTACCTCA AGCTGGTAAG ACTGGTACAT CAAACTATAC AGATGATGAA 800 

ATTGAAAACT ACATCAAAAA TACTGGTTAT GTAGCTCCAG ACGAAATGTT 850 

TGTTGGTTAT ACTCGCAAAT ATTCAATGGC GGTATGGACA GGTTACTCAA 900 

ACCGCCTGAC TCCTATCGTT GGTGATGGCT TCTATGTTGC AGCTAAGGTT 950 

TACCGTTCAA TGATGACTTA TCTGTCTGAG GATAACAACC CTGGCGACTG 1000 

GACTATGCCA GAAGGTCTCT ATCGAAGTGG TGAGTTCGTC TTTAAAAAAG 1050 

GTGCTCGTTC TGCATGGACT GCTCCTGCTC CGCAACAGGC CCCAACACCA 1100 

GAAAGTTCGA GCTCGACATC AGAAAGTTCA ACTTCACAGT CAAGCTCAAC 1150 

TACTCCAAGC ACGAATAATA GTGCAAACAA TAATACCAAT AACCAGCAAC 1200 

CAAATACAAC GCCTGGTCAA CAAAACCAGA ACCAAAATCA GA 1242 



2) INFORMATION FOR SEQ ID NO: 2058 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1225 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2058 

GAAGCTCAAA AACATCTGTG GGATATTTAC AATACAGACG AATACGTTGC 50 

CTATCCAGAC GATGAATTGC AAGTCGCTTC TACCATTGTT GATGTTTCTA 100 

ACGGGAAAGT TATTGCTCAG TTAGGTTCTC GTCACCAATC AAGCAATGTT 150 

TCCTTCGGAA TCAACCAAGC TGTTGAAACC AACCGTGACT GGGGTTCTGC 200 

CATGAAGCCA ATCACAGACT ATGCTCCTGC CTTAGAGTAT GACATCTACG 250 

ACTCAACTGC TTCGATTGTA CATGATGTTC CTTATAACTA TCCAGGTACT 300 

GATACTCCCC TCTACAACTG GGATAAAGTC TACTTTGGAA ATATTACAAT 350 

CCAGTATGCA CTTCAACAGT CACGTAATGT CACAGCCGTT GAGACTTTGA 4 00 

ATAAGGTCGG TCTAGATAGA GCTAAAACCT TTCTTAATGG TCTTGGTATC 450 

GACTATCCAA GCATGCATTA TGCAAACGCC ATTTCAAGTA ATACGACTGA 500 

GTCAAACAAA AAGTACGGAG CAAGTAGTGA GAAAATGGCT GCTGCTTACG 550 

CTGCTTTTGC TAACGGTGGT ATCTACCATA AACCAATGTA TATCAACAAA 600 

ATCGTCTTTA GCGATGGTAG CTCAAAAGAA TACGCTGATC CTGGTACTCG 650 

TGCCATGAAA GAGACGACCG CCTATATGAT GACAGAAATG ATGAAGACTG 700 

TCTTGGCATA CGGAACGGGT CGTGGTGCTT ATCTCCCTTG GCTACCTCAA 750 

GCTGGTAAGA CTGGTACATC AAACTATACA GATGATGAAA TTGAAAACTA 800 

CATCAAAAAT ACTGGTTATG TAGCTCCAGA CGAAATGTTT GTTGGTTATA 850 

CTCGCAAATA TTCAATGGCG GTATGGACAG GTTACTCAAA CCGCCTGACT 900 

CCTATCGTTG GTGATGGCTT CTATGTTGCA GCTAAGGTTT ACCGTTCAAT 950 

GATGACTTAT CTGTCTGAGG ATAACAACCC TGGCGACTGG ACTATGCCAG 1000 

AAGGTCTCTA TCGAAGTGGT GAGTTCGTCT TTAAAAAAGG TGCTCGTTCT 1050 

GCATGGACTG CTCCTGCTCC GCAACAGGCC CCAACACCAG AAAGTTCGAG 1100 

CTCGACATCA GAAAGTTCAA CTTCACAGTC AAGCTCAACT ACTCCAAGCA 1150 

CGAATAATAG TGCAAACAAT AATACCAATA ACCAGCAACC AAATACAACG 1200 

CCTGGTCAAC AAAACCAGAA CCAAA 1225 



2) INFORMATION FOR SEQ ID NO: 2059 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-38 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2059 

TGGCTACTTT GGAAACATTA CAGTCCAGTA TGCTCTTCAA CAATCACGAA 50 

ATGTCACAGC CGTTGAGACT TTGAATAAGG TCGGTCTAGA TAGAGCTAAA 100 

ACCTTCCTCA ATGGACTTGG TATCGATTAT CCAAGCATGC TTTATGCAAA 150 

CGCCATTTCA AGTAACACAA CTGAATCCAA CAAAAAGTAC GGAGCAAGTA 200 

GTGAAAAAAT GGCCGCTGCC TACGCAGCTT TTGCTAATGG TGGTACTTAC 250 

CACAAACCAA TGTATATCAA TAAAATCGTC TTTAGTGATG GTAGTGAAAA 300 

AGAATTTTCT GATGCCGGTA CTCGGGCTAT GAAAGAAACT ACTGCCTATA 350 
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TGATGACCGA AATGATGAAA 
GCCTACCTAC CTTGGCTTCC 
TACTGACGAA GAAATTGAAA 
CAGATGAAAT GTTTGTTGGT 
ACAG 



ACTGTCTTAT TATACGGAAC 
ACAAGCAGGT AAGACAGGTA 
AGTATATCAA GAATGCTGGT 
TATACCCGCA AATATGCAAT 



CGGACGTGGA 4 00 

CTTCTAACTA 450 
TACGTAGCTC . 500 
GGCTGTTTGG 550 
554 



2) INFORMATION FOR SEQ ID NO: 2060 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1249 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : , Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-57 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2060 

CAACTGGGAT GGATGTCTAC ACAAATGTAG ACCAAGAAGC TCAAAAACAT 50 

CTGTGGGATA TTTACAATAC AGACGAATAC GTTGCCTATC CAGACGATGA 100 

ATTGCAAGTC GCTTCTACCA TTGTTGATGT TTCTAACGGT AAAGTCATTG 150 

CCCAGCTAGG AGCACGCCAT CAGTCAAGTA ATGTTTCCTT CGGAATTAAC 2 00 

CAAGCAGTAG AAACAAACCG CGACTGGGGA TCAACTATGA AACCGATCAC 250 

AGACTATGCT CCTGCCTTGG AGTACGGTGT CTACGATTCA ACTGCTACTA 3 00 

TCGTTCACGA TGAGCCCTAT AACTACCCTG GGACAAATAC TCCTGTTTAT 3 50 

AACTGGGATA GGGGCTACTT TGGCAACATC ACCTTGCAAT ACGCCCTGCA 4 00 

ACAATCGCGA AACGTCCCAG CCGTGGAAAC TCTAAACAAG GTCGGACTCA 4 50 

ACCGCGCCAA GACTTTCCTA AATGGTCTAG GAATCGACTA CCCAAGTATT 500 

CACTACTCAA ATGCCATTTC AAGTAACACA ACCGAATCAG ACAAAAAATA 550 

TGGAGCAAGT AGTGAAAAGA TGGCTGCTGC TTACGCTGCC TTTGCAAATG 600 

GTGGAACTTA CTATAAACCA ATGTATATCC ATAAAGTCGT CTTTAGTGAT 650 

GGGAGTGAAA AAGAGTTCTC TAATGTCGGA ACTCGTGCCA TGAAGGAAAC 700 

GACAGCCTAT ATGATGACCG ACATGATGAA AACAGTCTTG ACTTATGGAA 750 

CTGGACGAAA TGCCTATCTT GCTTGGCTCC CTCAGGCTGG TAAAACAGGA 800 

ACCTCTAACT ATACAGACGA GGAAATTGAA AACCACATCA AGACCTCTCA 850 

ATTTGTAGCA CCTGATGAAC TATTTGCTGG CTATACGCGT AAATATTCAA 900 

TGGCTGTATG GACAGGCTAT TCTAACCGTC TGACACCACT TGTAGGCAAT 950 

GGCCTTACGG TCGCTGCCAA AGTTTACCGC TCTATGATGA CCTACCTGTC 1000 

TGAAGGAAGC AATCCAGAAG ATTGGAATAT ACCAGAGGGG CTCTACAGAA 1050 

ATGGAGAATT CGTATTTAAA AATGGTGCTC GTTCTACGTG GAGCTCACCT 1100 

GCTCCACAAC AACCCCCATC AACTGAAAGT TCAAGCTCAT CATCAGATAG 1150 

TTCAACTTCA CAGTCTAGCT CAACCACTCC AAGCACAAAT AATAGTACGA 1200 

CTACCAATCC TAACAATAAT ACGCAACAAT CAAATACAAC CCCTGATCA 124 9 



2) INFORMATION FOR SEQ ID NO: 2061 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2061 

GAAGTTGAAC AGACTGGTCA CCAAGCTCCT TCATATCCGC CAACACTGCA 50 

ATTTTCTTGC CACCTTCATT GGCTGGAATG GCAGAGAAAG TCTCTAAAAT 100 

CAGTTTCATA GCAGTTGGAT TGGCATTATA AACATCTGAC AGGATATCTG 150 

CTCCATTGGC TGCTTTCTTC CACTCGGTAC GGTTACGCGT CAATTCAAGA 200 

TGTTGGAAGG CCAAACGAAT TTGCTCCTCT GAAACTCCTT CTTGCAAGGC 250 

AACATAGGAT GCAATCATAG CATTTGTCGC ATTGTACTTG CCAGTTACTG 300 

GCAAATCAAG GGCTTGCTCT AAGAAATTGA CCTTGAAGGT CAGACTATCT 350 

TTGCGCTCAA CCAAGTCGGT AATTCCCAGC TCTGCTCCTT GACCAAAACG 4 00 

AACCACCTTT TTATCAGTTG GCAAATAGTC CTCTACGATA GGGTCAGCCG 4 50 

GCGCTAAAAG CAAGGAACCT GAAGCCATTC CGTCTGCAAT TTGCATTTTT 5 00 

CCTTTAGCAA TCTCAGAACG GTCTTTGAAA AAGGCCAAAT GAGCTTCTCC 550 

AACCAAGGTC ACGATGGCTG TATGGACAG 579 



2) INFORMATION FOR SEQ ID NO: 2062 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1216 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-63 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2062 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCACAAC TTGGTGCTCG TCATCAAGCA AGTAATGTTT 150 

CATTCGGTAC CAACCAGGCC GTAGAAACCA ATCGTGACTG GGGATCATCA 200 

ATGAAACCAA TCACTGACTA TGCTCCCGCT TTAGAATATG GAGTCTATGA 250 

CTCTACTGCT TCTATTGTAC ATGATGTCCC TTATAACTAT CCTGGCACTG 300 

ATACTCCACT CTACAACTGG GATCATGTCT ACTTTGGAAA CATTACAATC 350 

CAGTATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 4 00 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA CACAACTGAA 500 

TCCAACAAAA AATATGGTGC AAGTAGTGAA AAAATGGCTG CTGCCTACGC 550 

TGCTTTTGCT AATGGTGGTA TTTATCACAA ACCAATGTAT ATCAATAAAA 600 

TCGTCTTTAG TGATGGTAGC GAAAAAGAAT TTTCTGATGC TGGTACACGA 650 

GCTATGAAAG AGACTACTGC CTATATGATG ACTGAAATGA TGAAAACTGT 700 

TTTAACTTAC GGAACAGGAC GTGGAGCCTA CCTACCATGG CTTCCACAAG 750 
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ACGAAGAAAT TGAAAAGTAT 800 

GAAATGTTTG TAGGGTATAC 850 

ATACTCAAAT CGTCTAACTC 900 

GTAAAGTCTA TCGTTCAATG 950 

GGAGATTGGA CAATGCCAGA 1000 

TAAAAATGGT GCTCGTTCTA 1050 

CATCAACTGA AAGTTCAAGC 1100 

AGCTCAACCA CTCCAAGCAC 1150 

TAATACGCAA CAATCAAATA 1200 

1216 



2) INFORMATION FOR SEQ ID NO: 2063 . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: ATCC 700673 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2063 

AATACGTCTC TTACCCTGAC GATGATTTGC AAGTCGCATC CACGGTCGTA 50 

GATGTTTCAA ATGGTAAAGT CATCGCCCAA CTTGGAGCTC GTCACCAAGC 100 

AAGTAACGTT TCATTTGGTA CCAACCAAGC TGTGGAAACC AATCGTGACT 150 

GGGGTTCAAC AATGAAACCC ATCACCGATT ATGCACCTGC CATAGAATAC 2 00 

GGTGTATATG ATTCCACTGC AACTATGGTT AATGATATTC CTTATAACTA 2 50 

TCCGGGAACA AGCACACCTG TCTACAACTG GGATCGAGCA TATTTTGGTA 3 00 

ATATTACTCT GCAATATGCC CTTCAACAAT CTCGTAACGT ACCCGCCGTT 350 

GAGACACTAA ACAAGGTTGG TTTAGATAGA GCTAAAAGTT TCCTAAATGG 4 00 

TTTAGGAATC GACTATCCTG TAATGCACTA TTCAAATGCT ATTTCAAGTA 4 50 

ATACTACCGA ATCTAGTAAA CAGTACGGGG CAAGTAGTGA AAAAATGGCC 500 

ACTGCCTATG CCGCATTCGC AAACGGCGGT ATTTACCACA AACCAATGTA 550 

CATCAATAAG GTTGTCTTTA GCGATGGTAG CGAAAAAGAA TTTTCTGACC 600 

CTGGCACAAG AGCCATGAAA GAAACGACTG CTTACATGAT GACAGAGATG 650 

ATGAAAACAG TCTGGACTTA CGGAACTGGT CGCGGTGCCT ACCTACCTTG 700 

GCTTCCACAA GCAGGTAAAA CAGGTACTTC TAACTATACT GACGAAGAAA 750 

TTGAAAAGTA TATCAAGAAC ACTGGTTACG TAGCTCCAGA TGAAATGTTT 8 00 

GTAGGGTATA 810 



2) INFORMATION FOR SEQ ID NO: 2064 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 782 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: ATCC 700678 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2064 

TCTTACCCTG ACGATGATTT GCAAGTCGCA TCTACGGTCG TAGATGTTTC 50 

AAATGGTAAA GTCATCGCCC AACTTGGAGC TCGTCACCAA GCAAGTAACG 100 

TTTCATTTGG TACCAACCAA GCTGTGGAAA CCAATCGTGA CTGGGGATCA 150 

ACTATGAAAC CAATCACAGA CTATGCTCCT GCCTTGGAGT ACGGTGTCTA 200 

CGATTCAACT GCTACTATCG TTCACGATGA GCCCTATAAC TACCCTGGGA 250 

CAGATATCCC TCTCTATAAC TGGGATCGAG CATATTTCGG TAATATTACT 300 

CTGCAATATG CCCTTCAACA ATCTCGTAAC GTACCTGCCG TTGAAACACT 350 

AAACAAGGTC GGTCTAGATA AGGCTAAAAC CTTCCTTAAT GGTCTTGGTA 4 00 

TCGACTATCC AAGCATGCAT TATGCAAACG CCATTTCAAG TAATACAACT 450 

GAATCCAACA AAAAATATGG TGCAAGTAGT GAAAAAATAG CTACCGCCTA 500 

TGCCGCATTC GCAAATGGTG GTATTTACCA CAAACCAATG TACATCAATA 550 

AAGTTGTCTT TAGCGATGGT AGCGAAAAAG AATTTTCTGA CCCTGGCACA 600 

AGAGCCATGA AAGAAACGAC TGCTTACATG ATGACAGAAA TGATGAAAAC 650 

AGTCTGGACG TACGGAACTG GTCGTGGTGC CTACCTGCCT TGGCTTCCTC 700 

AAGCTGGTAA AACAGGTACC TCTAACTATA CTGACGAAGA AATTGAAAAG 750 

TATATCAAGA ACACTGGTTA CGTAGCTCCA GA 782 



2) INFORMATION FOR SEQ ID NO: 2065 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2065 
CCAGGACGTG GAGGCGATCA CA 22 



2) INFORMATION FOR SEQ ID NO: 2066 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2066 
CACCGACAGC GAGCCGATCA GA 22 
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2) INFORMATION FOR SEQ ID NO: 2067 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 2067 
AGCTGAGCCA ATTCATGG 18 



2) INFORMATION FOR SEQ ID NO: 2068 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1068 
ATTCATGGAC CAGAACAAC 19 



2) INFORMATION FOR SEQ ID NO: 2069 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2069 
CGCTGTCGGG GTTGACCC 



2) INFORMATION FOR SEQ ID NO: 2070 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2070 
GTTGACCCAC AAGCGCCG 18 



2) INFORMATION FOR SEQ ID NO: 2071 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY:, Linear 

(ii) MOLECULE TYPE: DNA 
(xi) 

CGACTGTCGG CGCTGGGG 



2) INFORMATION FOR SEQ ID NO: 2072 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3534 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



18 



SEQUENCE DESCRIPTION: SEQ ID NO: 2071 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: Rv 

(C) ACCESSION NUMBER: L27989 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2072 

GTGCTGGAAG GATGCATCTT GGCAGATTCC CGCCAGAGCA AAACAGCCGC 50 

TAGTCCTAGT CCGAGTCGCC CGCAAAGTTC CTCGAATAAC TCCGTACCCG 100 

GAGCGCCAAA CCGGGTCTCC TTCGCTAAGC TGCGCGAACC ACTTGAGGTT 150 

CCGGGACTCC TTGACGTCCA GACCGATTCG TTCGAGTGGC TGATCGGTTC 200 

GCCGCGCTGG CGCGAATCCG CCGCCGAGCG GGGTGATGTC AACCCAGTGG 250 

GTGGCCTGGA AGAGGTGCTC TACGAGCTGT CTCCGATCGA GGACTTCTCC 3 00 

GGGTCGATGT CGTTGTCGTT CTCTGACCCT CGTTTCGACG ATGTCAAGGC 350 

ACCCGTCGAC GAGTGCAAAG ACAAGGACAT GACGTACGCG GCTCCACTGT 4 00 

TCGTCACCGC CGAGTTCATC AACAACAACA CCGGTGAGAT CAAGAGTCAG 4 50 

ACGGTGTTCA TGGGTGACTT CCCGATGATG ACCGAGAAGG GCACGTTCAT 500 

CATCAACGGG ACCGAGCGTG TGGTGGTCAG CCAGCTGGTG CGGTCGCCCG 550 

GGGTGTACTT CGACGAGACC ATTGACAAGT CCACCGACAA GACGCTGCAC 600 

AGCGTCAAGG TGATCCCGAG CCGCGGCGCG TGGCTCGAGT TTGACGTCGA 650 

CAAGCGCGAC ACCGTCGGCG TGCGCATCGA CCGCAAACGC CGGCAACCGG 700 
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TCACCGTGCT GCTCAAGGCG CTGGGCTGGA CCAGCGAGCA GATTGTCGAG 750 

CGGTTCGGGT TCTCCGAGAT CATGCGATCG ACGCTGGAGA AGGACAACAC 800 

CGTCGGCACC GACGAGGCGC TGTTGGACAT CTACCGCAAG CTGCGTCCGG 850 

GCGAGCCCCC GACCAAAGAG TCAGCGCAGA CGCTGTTGGA AAACTTGTTC 900 

TTCAAGGAGA AGCGCTACGA CCTGGCCCGC GTCGGTCGCT ATAAGGTCAA 950 

CAAGAAGCTC GGGCTGCATG TCGGCGAGCC CATCACGTCG TCGACGCTGA 1000 

CCGAAGAAGA CGTCGTGGCC ACCATCGAAT ATCTGGTCCG CTTGCACGAG 1050 

GGTCAGACCA CGATGACCGT TCCGGGCGGC GTCGAGGTGC CGGTGGAAAC 1100 

CGACGACATC GACCACTTCG GCAACCGCCG CCTGCGTACG GTCGGCGAGC 1150 

TGATCCAAAA CCAGATCCGG GTCGGCATGT CGCGGATGGA GCGGGTGGTC 1200 

CGGGAGCGGA TGACCACCCA GGACGTGGAG GCGATCACAC CGCAGACGTT 1250 

GATCAACATC CGGCCGGTGG TCGCCGCGAT CAAGGAGTTC TTCGGCACCA 1300 

GCCAGCTGAG CCAATTCATG GACCAGAACA ACCCGCTGTC GGGGTTGACC 1350 

CACAAGCGCC GACTGTCGGC GCTGGGGCCC ,GGCGGTCTGT CACGTGAGCG 14 00 

TGCCGGGCTG GAGGTCCGCG ACGTGCACCC GTCGCACTAC GGCCGGATGT 1450 

GCCCGATCGA AACCCCTGAG GGGCCCAACA TCGGTCTGAT CGGCTCGCTG 1500 

TCGGTGTACG CGCGGGTCAA CCCGTTCGGG TTCATCGAAA CGCCGTACCG 1550 

CAAGGTGGTC GACGGCGTGG TTAGCGACGA GATCGTGTAC CTGACCGCCG 1600 

ACGAGGAGGA CCGCCACGTG GTGGCACAGG CCAATTCGCC GATCGATGCG 1650 

GACGGTCGCT TCGTCGAGCC GCGCGTGCTG GTCCGCCGCA AGGCGGGCGA 1700 

GGTGGAGTAC GTGCCCTCGT CTGAGGTGGA CTACATGGAC GTCTCGCCCC 1750 

GCCAGATGGT GTCGGTGGCC ACCGCGATGA TTCCCTTCCT GGAGCACGAC 1800 

GACGCCAACC GTGCCCTCAT GGGGGCAAAC ATGCAGCGCC AGGCGGTGCC 1850 

GCTGGTCCGT AGCGAGGCCC CGCTGGTGGG CACCGGGATG GAGCTGCGCG 1900 

CGGCGATCGA CGCGGCGACG TCGTCGTCGC AAGAAAGCGG CGTCATCGAG 1950 

GAGGTGTCGG CCGACTACAT CACTGTGATG CACGACAACG GCACCCGGCG 2 000 

TACCTACCGG ATGCGCAAGT TTGCCCGGTC CAACCACGGC ACTTGCGCCA 2050 

ACCAGTGCCC CATCGTGGAC GCGGGCGACC GAGTCGAGGC CGGTCAGGTG 2100 

ATCGCCGACG GTCCCTGTAC TGACGACGGC GAGATGGCGC TGGGCAAGAA 2150 

CCTGCTGGTG GCCATCATGC CGTGGGAGGG CCACAACTAC GAGGACGCGA 22 00 

TCATCCTGTC CAACCGCCTG GTCGAAGAGG ACGTGCTCAC CTCGATCCAC 2250 

ATCGAGGAGC ATGAGATCGA TGCTCGCGAC ACCAAGCTGG GTGCGGAGGA 23 00 

GATCACCCGC GACATCCCGA ACATCTCCGA CGAGGTGCTC GCCGACCTGG 2350 

ATGAGCGGGG CATCGTGCGC ATCGGTGCCG AGGTTCGCGA CGGGGACATC 2400 

CTGGTCGGCA AGGTCACCCC GAAGGGTGAG ACCGAGCTGA CGCCGGAGGA 24 50 

GCGGCTGCTG CGTGCCATCT TCGGTGAGAA GGCCCGCGAG GTGCGCGACA 2500 

CTTCGCTGAA GGTGCCGCAC GGCGAATCCG GCAAGGTGAT CGGCATTCGG 2550 

GTGTTTTCCC GCGAGGACGA GGACGAGTTG CCGGCCGGTG TCAACGAGCT 2 600 

GGTGCGTGTG TATGTGGCTC AGAAACGCAA GATCTCCGAC GGTGACAAGC 2650 

TGGCCGGCCG GCACGGCAAC AAGGGCGTGA TCGGCAAGAT CCTGCCGGTT 2700 

GAGGACATGC CGTTCCTTGC CGACGGCACC CCGGTGGACA TTATTTTGAA 2750 

CACCCACGGC GTGCCGCGAC GGATGAACAT CGGCCAGATT TTGGAGACCC 2800 

ACCTGGGTTG GTGTGCCCAC AGCGGCTGGA AGGTCGACGC CGCCAAGGGG 2850 

GTTCCGGACT GGGCCGCCAG GCTGCCCGAC GAACTGCTCG AGGCGCATGC 2 900 

GAACGCCATT GTGTCGACGC CGGTGTTCGA CGGCGCCCAG GAGGCCGAGC 2 950 

TGCAGGGCCT GTTGTCGTGC ACGCTGCCCA ACCGCGACGG TGACGTGCTG 3000 

GTCGACGCCG ACGGCAAGGC CATGCTCTTC GACGGGCGCA GCGGCGAGCC 3050 

GTTCCCGTAC CCGGTCACGG TTGGCTACAT GTACATCATG AAGCTGCACC 3100 

ACCTGGTGGA CGACAAGATC CACGCCCGCT CCACCGGGCC GTACTCGATG 3150 

ATCACCCAGC AGCCGCTGGG CGGTAAGGCG CAGTTCGGTG GCCAGCGGTT 32 00 

CGGGGAGATG GAGTGCTGGG CCATGCAGGC CTACGGTGCT GCCTACACCC 3250 

TGCAGGAGCT GTTGACCATC AAGTCCGATG ACACCGTCGG CCGCGTCAAG 3300 

GTGTACGAGG CGATCGTCAA GGGTGAGAAC ATCCCGGAGC CGGGCATCCC 3350 

CGAGTCGTTC AAGGTGCTGC TCAAAGAACT GCAGTCGCTG TGCCTCAACG 34 00 

TCGAGGTGCT ATCGAGTGAC GGTGCGGCGA TCGAACTGCG CGAAGGTGAG 3450 

GACGAGGACC TGGAGCGGGC CGCGGCCAAC CTGGGAATCA ATCTGTCCCG 3 500 

CAACGAATCC GCAAGTTTCG AGGATCTTGC GTAA 3534 
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2) INFORMATION FOR SEQ ID NO: 2073 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2073 
CCGAGCAACA TGATTGAACC ATCCACCAAC TGGCTCGG 38 



2) INFORMATION FOR SEQ ID NO: 2074 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2074 
CCGAGCAACA TGATTGAAGC TTCCACCAAC TGGCTCGG 38 



2) INFORMATION FOR SEQ ID NO: 2075 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2075 
CCGAGCCAGG TTCTGAAGTC TCTGCATTAT TAGGTGCTCG G 41 



2) INFORMATION FOR SEQ ID NO: 2076 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear ^ 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2076 
CCGAGCYGAY AACATTTTCA GATTCACCCA RGCGCTCGG 



2) INFORMATION FOR SEQ ID NO: 2077 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: • Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2077 
CCGAGCAACC GATCCAGCTC CAGCTACGCT CGG 



2) INFORMATION FOR SEQ ID NO: 2078 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2078 
CCGAGCCTTG GTCTTCGGCC AAATGAACGC TCGG 



2) INFORMATION FOR SEQ ID NO: 2079 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2079 
CCGAGCGTTC AGTTACTTCA GTCCAAGCCG GCTCGG 
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2) INFORMATION FOR SEQ ID NO: 2080 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2080 
CCGAGCCGAA GAGGGCCAAG ATGTCGCTCG G 



2) INFORMATION FOR SEQ ID NO: 2081 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2081 
GRATYRTYAA AGTTGGTGAG GAAG 



2) INFORMATION FOR SEQ ID NO: 2082 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2082 
CMACTTCATC YCGCTTCGTA CC 



2) INFORMATION FOR SEQ ID NO: 2083 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2083 
CCGCCGATGT TCCGTAAATT ACTTGAIGAA GGTCGAGCCG GCGG 



2) INFORMATION FOR SEQ ID NO: 2084 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2084 
CACGCGTCAA CACCCGTACA AGTCGTCTTT TGCGCGTG 



2) INFORMATION FOR SEQ ID NO: 2085 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2085 
CAAACTAAAG AACATATCTT GCTA 



2) INFORMATION FOR SEQ ID NO: 2086 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2086 
ATATAATTTG CATCACCTTC AAG 
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2) INFORMATION FOR SEQ ID NO: 2087 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2087 
TCAGCTCGTG GGATTAGGAG AG 22 



2) INFORMATION FOR SEQ ID NO: 2088 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2088 
AGGCTTCACG CTGTTAGGCT GA 22 



2) INFORMATION FOR SEQ ID NO: 2089 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2089 
ATGCTGAACT TATTGACCTT 20 



2) INFORMATION FOR SEQ ID NO: 2090 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2090 
CGTTACTGGA GTCGAAATG 



2) INFORMATION FOR SEQ ID NO: 2091 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: • Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2091 
CGCGACTTGA GATGGAACTT AGTGAGCTTC TTGGTCGCG 



2) INFORMATION FOR SEQ ID NO: 2092 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2092 
CGCGACGAAA GAACTTCCTG AAGGTCGTGC AGGTCCAG 



2) INFORMATION FOR SEQ ID NO: 2093 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2093 
TGTTGGCAAT CGAAGACACC 
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2) INFORMATION FOR SEQ ID NO: 2094 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2094 

TTCAATTTCT TGACCTACTT TCAA 24 
2) INFORMATION FOR SEQ ID NO: 2095 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2095 
CGGTCGGGTT GAACGTGG 18 



2) INFORMATION FOR SEQ ID NO: 2096 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2096 
CGCGACCGGT ACCACGGCCA GTAATCGTGT CGCG 34 
2) INFORMATION FOR SEQ ID NO: 2097 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Mycoplasma pneumoniae 

(B) STRAIN: ATCC 29342 

(C) ACCESSION NUMBER: AE000019 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2097 

ATGGCAAGAG AGAAATTTGA CCGATCTAAA CCCCACGTTA ATGTAGGTAC 50 

TATTGGCCAC ATTGACCACG GTAAAACAAC TTTAACAGCA GCTATTTGTA 100 

CTGTATTAGC AAAAGAAGGT AAATCAGCTG CTACTCGTTA CGACCAAATC 150 

GATAAGGCTC CGGAAGAAAA AGCACGGGGA ATTACGATTA ACTCCGCTCA 200 

CGTGGAGTAC TCCTCTGACA AGCGTCACTA TGCTCACGTT GACTGTCCAG 250 

GACACGCTGA CTACATTAAG AACATGATTA CTGGTGCTGC ACAAATGGAT 300 

GGTGCCATTC TAGTAGTTTC AGCAACTGAC AGTGTTATGC CCCAAACCCG 350 

TGAACACATT TTGTTGGCCC GCCAAGTGGG TGTGCCACGC ATGGTAGTGT 4 00 

TCCTAAACAA GTGTGACATT GCAACTGATG AAGAAGTGCA AGAGTTAGTA 450 

GCAGAAGAGG TACGTGACTT ATTAACTTCT TACGGCTTTG ATGGCAAGAA 500 

CACCCCTATT ATTTATGGTT CTGCACTTAA AGCGCTTGAA GGTGATCCTA 550 

AGTGGGAAGC TAAGATCCAT GATTTAATGA ATGCAGTTGA TGAATGGATT 600 

CCAACTCCTG AACGTGAAGT GGACAAACCC TTCTTGTTGG CAATCGAAGA 650 

CACCATGACG ATTACTGGCC GTGGTACCGT GGTTACCGGT CGGGTTGAAC 700 

GTGGTGAATT GAAAGTAGGT CAAGAAATTG AAATCGTTGG TTTACGTCCA 750 

ATCCGTAAAG CAGTTGTTAC CGGAATCGAA ATGTTCAAAA AGGAACTTGA 800 

TTCAGCAATG GCTGGGGACA ACGCTGGGGT ATTACTCCGT GGTGTGGACC 850 

GTAAAGAAGT GGAACGTGGT CAAGTGTTAG CTAAACCAGG TTCGATTAAA 900 

CCGCACAAGA AATTTAAAGC GGAAATCTAT GCTTTAAAGA AGGAAGAAGG 950 

TGGTCGTCAC ACCGGTTTCT TAAACGGTTA CCGTCCCCAA TTCTACTTCC 1000 

GTACTACAGA CGTTACTGGT TCGATTTCCC TACCAGAAAA CACCGAAATG 1050 

GTGCTACCAG GTGACAATAC CTCGATTACA GTTGAACTAA TTGCACCAAT 1100 

TGCTTGTGAA AAAGGTAGTA AGTTCTCCAT CCGTGAAGGT GGTCGAACGG 1150 

TTGGTGCTGG TTCAGTCACG GAAGTGCTTG AATAG 1185 



2) INFORMATION FOR SEQ ID NO: 2098 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2098 
CTGAGTCACA CCGACAAACG TC 



2) INFORMATION FOR SEQ ID NO: 2099 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2099 
CCAGGACTGA ACGGGATACG AA 22 



2) INFORMATION FOR SEQ ID NO: 2100 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE:, DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2100 
GCGAGACGAT AGGTTGTC 18 



2) INFORMATION FOR SEQ ID NO: 2101 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2609 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: H37Rv 

(C) ACCESSION NUMBER: Z79701 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2101 

CAGCCCGCGA GCGTAACCTG GCTGCGATTT CCGGCGCGGA TTTTCGCAGT 50 

GCGGTTACGC TCGGAAAGCG CGGGCCTCGC CCACGCGGCG GATGATGTCA 10 0 

GCGGGGTGGT CCTCGGCGAC GACCCGGACC ACGATCCACC CGTAGCGGTG 150 

CTGGACTTTC TCGTGCCGGA GGATGTCTTT CCGGTAGTGG TAGCGACTGG 20 0 

TCAGATGGTG GTCGCCGTCA TACTCGGCCG CGACCTTGAT GTCTTGCCAG 250 

CCCATATCCA AATGGGCTTC CGCCCAGCCC CATTCGTTGC GCACCGCGAT 3 00 

CTGCGTCTGG GGGCGCGGAA AGCCGGCGCG GATCAACAAC AAGCGCAGCC 3 50 

AGGTTTCCTT GGGGGACTGG GCACCGCCGT CGACGAGGTC CAGAGCGGCT 400 

CTTGCGGCCT TCATGCCACG GCGGCCCCGA TAGCGCTCGA TCAGCGGCTC 4 50 

GACGTCGGCC ACCTTCAAAT CGGTGGCCTG TATCAGGGCG TCGACGGCCG 500 

CGACGGCGGG GTCCAATGGA AATCGACTGG TCAGGTCGAG CGCCGTTCGC 550 

TCCGGTGTGG TCACGCGCAT GCCCTCGATG ACGCAGATCT CGTCGGGCTC 600 

GATGCGCTCT TCCCAGACTT GCAGCCCCGG GGCACGGCGG CGGTTGGTGT 650 

CGATGATCGC GGCGGGAAGA TCCGCGTCGA TCCACTTGGC GCCATGGAAG 700 

GCAGAAGCCG AGTAGCCGGC CAGCACGCCG CGGCGGCGCG AGCGCAGCCA 750 

CAGCGCTTTT GCACGCAATT GCGCGGTCAG TTCCACACCC TGCGGCACGT 800 
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ACACGTCTTT ATGTAGCGCG 
AATACACCCG CAGCCAGGGC 
CGAAGTGTGC TGAGTCACAC 
GAAAGTTCCC GCCGGAAATC 
TTTCGGCCCG GCCGCGGCGA 
CCACTGAAGG GGCCAAACCC 
GGAGGAAACC GGGGGATCGG 
CGGCCACAAG GTGGCCGTCA 
TGTTTGGCGT CGAATGTGAC 
TTCACGGCGG TAGAAGAGCA 
CGCCGGCCTA TCCGCGGACG 
TCGAGAAGGT CATCAACGCC 
CGGGCATCGC GCAGCATGCA 
AGGTTCGGTC TCCGGCAGCT 
CCTCCAAGGC CGGAGTGATT 
TCGAAGGCAA ACGTGACCGC 
CGATATGACC CGCGCGCTGG 
TTATCCCAGC GAAGCGGGTC 
AGCTTCCTGG CTTCCGAGGA 
GGTCGACGGC GGCATGGGTA 
TGACAGGACT GCTGGACGGC 
GACTCGTCGA TCGCGTTTCA 
CCAGCTGGTG CTCACCGGGT 
CCGACCGGCT GCCGGCAAAG 
GAGGAGCACC TGGCCAGCTT 
GGGCAACAAG CTCGACGGGG 
CCGGGATGGG CATCAACCCG 
AAGGGCATCC ACATCTCGGC 
GCTGCCGATC ATGAACCCCG 
CGAGCCGGGC GATGCCGGCC 
TTGGAGTCGG TCAACAGGTT 
GCGTTCGAAT CTCGTTGCCG 
CGATCGTCGG CGGTGCGCTC 
CTCGAGGAGG GCTGGGATCA 
TGCGACGCCG GTCGCCAAGA 
CGGCGACCAC GGGTGACATC 
TTGCTCTAG 



ACATACCTGC TGCGCAATTC 
CTCGCTGCCC AGAAAGGGAT 
CGACAAACGT CACGAGCGTA 
GCAGCCACGT TACGCTCGTG 
GACGATAGGT TGTCGGGGTG 
CCATTCGTAT CCCGTTCAGT 
GCTGGCGATC GCACAGCGGC 
CCCACCGTGG ATCCGGAGCG 
GTCACCGACA GCGACGCCGT 
CCAGGGTCCG GTCGAGGTGC 
CATTCCTCAT GCGGATGACC 
AACCTCACCG GGGCGTTCCG 
GCGCAACAAA TTCGGTCGAA 
GGGGCATCGG, CAACCAGGCC 
GGCATGGCCC GCTCGATCGC 
GAATGTGGTG GCCCCGGGCT 
ATGAGCGGAT TCAGCAGGGG 
GGCACCCCCG CCGAGGTCGC 
TGCGAGCTAT ATCTCCGGTG 
TGGGCCACTG ACACAACACA 
AAACGGATTC TGGTTAGCGG 
CATCGCACGG GTAGCCCAGG 
TCGACCGGCT GCGGCTGATT 
GCCCCGCTGC TCGAACTCGA 
GGCCGGCCGG GTGACCGAGG 
TGGTGCATTC GATTGGGTTC 
TTCTTCGACG CGCCCTACGC 
GTATTCGTAT GCTTCGATGG 
GAGGTTCCAT CGTCGGCATG 
TACAACTGGA TGACGGTCGC 
CGTGGCGCGC GAGGCCGGCA 
CAGGCCCTAT CCGGACGCTG 
GGCGAGGAGG CCGGCGCCCA 
GCGCGCTCCG ATCGGCTGGA 
CGGTGTGCGC GCTGCTGTCT 
ATCTACGCCG ACGGCGGCGC 



GTAGGGCGTC 850 

CCGTCATGGT 900 

ACCCCAGTGC 950 

GACATACCGA 1000 

ACTGCCACAG 1050 

CCTGGTTACC 1100 

TGGCTGCCGA 1150 

CCAAAGGGGC 1200 

CGATCGCGCC 1250 

TGGTGTCCAA 1300 

GAGGAAAAGT 1350 

GGTGGCTCAA 1400 

TGATATTCAT 1450 

AACTACGCAG 1500 

CCGCGAGCTG 1550 

ACATCGACAC 1600 

GCGCTGCAAT 1650 

CGGGGTGGTC 1700 

CGGTCATCCC 1750 

AGGACGCACA 1800 

AATCATCACC 1850 

AGCAGGGCGC 1900 

CAGCGCATCA 1950 

CGTGCAAAAC 2000 

CGATCGGGGC 2050 

ATGCCGCAGA 2100 

GGATGTGTCC 2150 

CCAAGGCGCT 2200 

GACTTCGACC 22 50 

CAAGAGCGCG 2300 

AGTACGGTGT 23 50 

GCGATGAGTG 2400 

GATCCAGCTG 2450 

ACATGAAGGA 2500 

GACTGGCTGC 2550 

GCACACCCAA 2600 
2609 



2) INFORMATION FOR SEQ ID NO: 2102 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2102 
CACCTTCACC CTGACCGACG 



2) INFORMATION FOR SEQ ID NO: 2103 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2103 
CGAACCAGCG GAAATAGTTG GAC 23 



2) INFORMATION FOR SEQ ID NO: 2104 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2104 
CTGGGCATGG CICGAGTC 18 



2) INFORMATION FOR SEQ ID NO: 2105 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3297 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: H37rv 

(C) ACCESSION NUMBER: U68480 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2105 

ATGACACAGT GCGCGAGCAG ACGCAAAAGC ACCCCAAATC GGGCGATTTT 50 

GGGGGCTTTT GCGTCTGCTC GCGGGACGCG CTGGGTGGCC ACCATCGCCG 100 

GGCTGATTGG CTTTGTGTTG TCGGTGGCGA CGCCGCTGCT GCCCGTCGTG 150 

CAGACCACCG CGATGCTCGA CTGGCCACAG CGGGGGCAAC TGGGCAGCGT 200 

GACCGCCCCG CTGATCTCGC TGACGCCGGT CGACTTTACC GCCACCGTGC 250 

CGTGCGACGT GGTGCGCGCC ATGCCACCCG CGGGCGGGGT GGTGCTGGGC 300 

ACCGCACCCA AGCAAGGCAA GGACGCCAAT TTGCAGGCGT TGTTCGTCGT 3 50 

CGTCAGCGCC CAGCGCGTGG ACGTCACCGA CCGCAACGTG GTGATCTTGT 4 00 

CCGTGCCGCG CGAGCAGGTG ACGTCCCCGC AGTGTCAACG CATCGAGGTC 4 50 
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ACCTCTACCC ACGCCGGCAC CTTCGCCAAC TTCGTCGGGC TCAAGGACCC 500 

GTCGGGCGCG CCGCTGCGCA GCGGCTTCCC CGACCCCAAC CTGCGCCCGC 550 

AGATTGTCGG GGTGTTCACC GACCTGACCG GGCCCGCGCC GCCCGGGCTG 600 

GCGGTCTCGG CGACCATCGA CACCCGGTTC TCCACCCGGC CGACCACGCT 650 

GAAACTGCTG GCGATCATCG GGGCGATCGT GGCCACCGTC GTCGCACTGA 700 

TCGCGTTGTG GCGCCTGGAC CAGTTGGACG GGCGGGGCTC AATTGCCCAG 750 

CTCCTCCTCA GGCCGTTCCG GCCTGCATCG TCGCCGGGCG GCATGCGCCG 800 

GCTGATTCCG GCAAGCTGGC GCACCTTCAC CCTGACCGAC GCCGTGGTGA 850 

TATTCGGCTT CCTGCTCTGG CATGTCATCG GCGCGAATTC GTCGGACGAC 900 

GGCTACATCC TGGGCATGGC CCGAGTCGCC GACCACGCCG GCTACATGTC 950 

CAACTATTTC CGCTGGTTCG GCAGCCCGGA GGATCCCTTC GGCTGGTATT 1000 

ACAACCTGCT GGCGCTGATG ACCCATGTCA GCGACGCCAG TCTGTGGATG 1050 

CGCCTGCCAG ACCTGGCCGC CGGGCTAGTG TGCTGGCTGC TGCTGTCGCG 1100 

TGAGGTGCTG CCCCGCCTCG GGCCGGCGGT, GGAGGCCAGC AAACCCGCCT 1150 

ACTGGGCGGC GGCCATGGTC TTGCTGACCG CGTGGATGCC GTTCAACAAC 1200 

GGCCTGCGGC CGGAGGGCAT CATCGCGCTC GGCTCGCTGG TCACCTATGT 1250 

GCTGATCGAG CGGTCCATGC GGTACAGCCG GCTCACACCG GCGGCGCTGG 13 00 

CCGTCGTTAC CGCCGCATTC ACACTGGGTG TGCAGCCCAC CGGCCTGATC 13 50 

GCGGTGGCCG CGCTGGTGGC CGGCGGCCGC CCGATGCTGC GGATCTTGGT 14 00 

GCGCCGYCAT CGCCTGGTCG GCACGTTGCC GTTGGTGTCG CCGATGCTGG 1450 

CCGCCGGCAC CGTCATCCTG ACCGTGGTGT TCGCCGACCA GACCCTGTCA 1500 

ACGGTGTTGG AAGCCACCAG GGTTCGCGCC AAAATCGGGC CGAGCCAGGC 1550 

GTGGTATACC GAGAACCTGC GTTACTACTA CCTCATCCTG CCCACCGTCG 1600 

ACGGTTCGCT GTCGCGGCGC TTCGGCTTTT TGATCACCGC GCTATGCCTG 1650 

TTCACCGCGG TGTTCATCAT GTTGCGGCGC AAGCGAATTC CCAGCGTGGC 1700 

CCGCGGACCG GCGTGGCGGC TGATGGGCGT CATCTTCGGC ACCATGTTCT 1750 

TCCTGATGTT CACGCCCACC AAGTGGGTGC ACCACTTCGG GCTGTTCGCC 1800 

GCCGTAGGGG CGGCGATGGC CGCGCTGACG ACGGTGTTGG TATCCCCATC 1850 

GGTGCTGCGC TGGTCGCGCA ACCGGATGGC GTTCCTGGCG GCGTTATTCT 1900 

TCCTGCTGGC GTTGTGTTGG GCCACCACCA ACGGCTGGTG GTATGTCTCC 1950 

AGCTACGGTG TGCCGTTCAA CAGCGCGATG CCGAAGATCG ACGGGATCAC 2000 

AGTCAGCACA ATCTTTTTCG CCCTGTTTGC GATCGCCGCC GGCTATGCGG 2050 

CCTGGCTGCA CTTCGCGCCC CGCGGCGCCG GCGAAGGGCG GCTGATCCGC 2100 

GCGCTGACGA CAGCCCCGGT ACCGATCGTG GCCGGTTTCA TGGCGGCGGT 2150 

GTTCGTCGCG TCCATGGTGG CCGGGATCGT GCGACAGTAC CCGACCTACT 2200 

CCAACGGCTG GTCCAACGTG CGGGCGTTTG TCGGCGGCTG CGGACTGGCC 2250 

GACGACGTAC TCGTCGAGCC TGATACCAAT GCGGGTTTCA TGAAGCCGCT 2300 

GGACGGCGAT TCGGGTTCTT GGGGCCCCTT GGGCCCGCTG GGTGGAGTCA 2350 

ACCCGGTCGG CTTCACGCCC AACGGCGTAC CGGAACACAC GGTGGCCGAG 24 00 

GCGATCGTGA TGAAACCCAA CCAGCCCGGC ACCGACTACG ACTGGGATGC 2450 

GCCGACCAAG CTGACGAGTC CTGGCATCAA TGGTTCTACG GTGCCGCTGC 2500 

CCTATGGGCT CGATCCCGCC CGGGTACCGT TGGCAGGCAC CTACACCACC 2550 

GGCGCACAGC AACAGAGCAC ACTCGTCTCG GCGTGGTATC TCCTGCCTAA 2600 

GCCGGACGAC GGGCATCCGC TGGTCGTGGT GACCGCCGCG GGCAAGATCG 2 650 

CCGGCAACAG CGTGCTGCAC GGGTACACCC CCGGGCAGAC TGTGGTGCTC 2700 

GAATACGCCA TGCCGGGACC CGGAGCGCTG GTACCCGCCG GGCGGATGGT 2750 

GCCCGACGAC CTATACGGAG AGCAGCCCAA GGCGTGGCGC AACCTGCGCT 2800 

TCGCCCGAGC AAAGATGCCC GCCGATGCCG TCGCGGTCCG GGTGGTGGCC 2850 

GAGGATCTGT CGCTGACACC GGAGGACTGG ATCGCGGTGA CCCCGCCGCG 2900 

GGTACCGGAC CTGCGCTCAC TGCAGGAATA TGTGGGCTCG ACGCAGCCGG 2950 

TGCTGCTGGA CTGGGCGGTC GGTTTGGCCT TCCCGTGCCA GCAGCCGATG 3000 

CTGCACGCCA ATGGCATCGC CGAAATCCCG AAGTTCCGCA TCACACCGGA 3050 

CTACTCGGCT AAGAAGCTGG ACACCGACAC GTGGGAAGAC GGCACTAACG 3100 

GCGGCCTGCT CGGGATCACC GACCTGTTGC TGCGGGCCCA CGTCATGGCC 3150 

ACCTACCTGT CCCGCGACTG GGCCCGCGAT TGGGGTTCCC TGCGCAAGTT 3200 

CGACACCCTG GTCGATGCCC CTCCCGCCCA GCTCGAGTTG GGCACCGCGA 3250 

CCCGCAGCGG CCTGTGGTCA CCGGGCAAGA TCCGAATTGG TCCATAG 3297 
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2) INFORMATION FOR SEQ ID NO: 2106 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2106 

GCGAGCAGAG CACGCCCTCC TCGCCGCTCG C 31 
2) INFORMATION FOR SEQ ID NO: 2107 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2107 
GCGAGCTCCC CATCTCTGGT TGGCACGCTC GC 32 



2) INFORMATION FOR SEQ ID NO: 2108 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2108 
GCGGGCAACT TCRTCAAGAA GGTTGGTTAC AACCCGCCCG C 41 



2) INFORMATION FOR SEQ ID NO: 2109 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2109 
GCGGGCCCAA TCTCTGGTTG GAAYGGTGAC AAGCCCGC 

2) INFORMATION FOR SEQ ID NO: 2110 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2110 
GCGGGCCCTT AACGATTTCA GCGAATCTGG ATTCAGCCCG C 41 



PCT/CA00/01150 



38 



2) INFORMATION FOR SEQ ID NO: 2111 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2111 
GCGGGCATGA TTGAAGCCAC CACCAACGCT TCCTGGCCCG C 41 



2) INFORMATION FOR SEQ ID NO: 2112 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2112 
GCGGGCTTGA TGAAGTTTTG GGTTTCCTTG ACAATTTGCC CGC 4 3 



2) INFORMATION FOR 



SEQ ID NO: 2113 
1089 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2113 
GCGGGCACAA GGGTTGGACT AAGGAAACCA AGGCAGCCCG C 41 



2) INFORMATION FOR SEQ ID NO: 2114 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2114 
GCGGGCATCG ATGCTATTGA ACCACCTGTC AGACCGCCCG C 41 
2) INFORMATION FOR SEQ ID NO: 2115 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2115 
GCGGGCTTGA TGATTTCCTC GAATCTAGAT TGGGCCCGC 39 



2) INFORMATION FOR SEQ ID NO: 2116 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2116 

GCGGGCGGTA AGTCCACCGG TAAGACCTTG TTGGCCCGC 39 
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2) INFORMATION FOR SEQ ID NO: 2117 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 2117 
GCGGGCGACG CCATTGAGCC ACCTTCGAGA GCCCGC 36 



2) INFORMATION FOR SEQ ID NO: 2118 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2118 
GCGGGCTCCT TGACAATTTC TTCGTATCTG TTCTTGGCCC GC 42 



2) INFORMATION FOR SEQ ID NO: 2119 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2119 
GCGGGCTTAC AACCCTAAGG CTGTTCCATT CGTTGCCCGC 40 



2) INFORMATION FOR SEQ ID NO: 2120 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2120 
GCGGGCAGAA ACCAAGGCTG GTAAGGTTAC CGGTAGCCCG C 41 



2) INFORMATION FOR SEQ ID NO: 2121 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDN.ESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2121 
CGACCGCIAG CCGCACACCA AGTTCCGGTC G 31 
2) INFORMATION FOR SEQ ID NO: 2122 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2122 
CCGAGCGAAT GTAGGAGTCC AGGGTCTCTG CTCGG 35 



2) INFORMATION FOR SEQ ID NO: 2123 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2123 
TCTACCACTG AAGCATTAC 19 
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2) INFORMATION FOR SEQ ID NO: 2124 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 2124 
TAGGTACTGT AGGTTTATTG 20 



2) INFORMATION FOR SEQ ID NO: 2125 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2125 
CACGCGGATT TTGAATCTCT TCCTCTAGTA GCGCGTG 37 



2) INFORMATION FOR SEQ ID NO: 2126 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2126 
ATATCAGAGA CTGATGAG 18 



2) INFORMATION FOR SEQ ID NO: 2127 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: 'Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2127 
TAGCATATTC AGAGAATATT GT 



2) INFORMATION FOR SEQ ID NO: 2128 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY:, Linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2128 
CAACGCTGGA GAATCTATAT TTGTAGAAAC TGCGTTG 



2) INFORMATION FOR SEQ ID NO: 2129 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8133 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: VPI 10463 

(C) ACCESSION NUMBER: X51797 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2129 



ATGTCTTTAA TATCTAAAGA 
ACCAAGAGAA AATGAGTATA 
ATAAGTTAAC TACAAACAAT 
CTAAATGAAT CAATTGATGT 
AAATAGAGCA CTCTCTAATC 
TTATTAAAAA TTCCAATACA 
TGGATAGGTG GAGAAGTCAG 
GGCTGATATT AATGCAGAAT 
CATTCTTAGT AAATACACTA 
GAAGCATTAC AGCTACTAGA 
TATGAAATTT TACAAAAAAA 
GGTTTATAAA TTATTATAAA 
ATAGATGATA TTATAAAGTC 
AACTGTATTA GAATCATATA 
ATCATGGGAT AGATATCAGG 
TTAAATATTT ATAGTCAGGA 



AGAGTTAATA AAACTCGCAT 
AAACTATACT AACTAATTTA 
AATGAAAATA AATATTTGCA 
TTTTATGAAT AAATATAAAA 
TAAAAAAAGA TATATTAAAA 
AGCCCTGTAG AAAAAAATTT 
TGATATTGCT CTTGAATACA 
ATAATATTAA ACTGTGGTAT 
AAAAAGGCTA TAGTTGAATC 
GGAAGAGATT CAAAATCCTC 
GGATGGAATT TATATATGAT 
TCTCAAATCA ATAAACCTAC 
TCATCTAGTA TCTGAATATA 
GAACAAATTC TTTGAGAAAA 
GCTAATAGTT TGTTTACAGA 
GTTGTTAAAT CGTGGAAATT 
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ATAGCATTAG 50 

GACGAATATA 100 

ATTAAAAAAA 150 

CTTCAAGCAG 2 00 

GAAGTAATTC 250 

ACATTTTGTA 300 

TAAAACAATG 350 

GATAGTGAAG 4 00 

TTCTACCACT 4 50 

AATTTGATAA 500 

AGACAAAAAA 550 

AGTACCTACA 600 

ATAGAGATGA 650 

ATAAATAGTA 700 

ACAAGAGTTA 750 

TAGCTGCAGC 800 
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ATCTGACATA GTAAGATTAT TAGCCCTAAA AAATTTTGGC GGAGTATATT v 850 
TAGATGTTGA TATGCTTCCA GGTATTCACT CTGATTTATT TAAAACAATA 900 
TCTAGACCTA GCTCTATTGG ACTAGACCGT TGGGAAATGA TAAAATTAGA 950 

GGCTATTATG AAGTATAAAA AATATATAAA TAATTATACA TCAGAAAACT 1000 

TTGATAAACT TGATCAACAA TTAAAAGATA ATTTTAAACT CATTATAGAA 1050 

AGTAAAAGTG AAAAATCTGA GATATTTTCT AAATTAGAAA ATTTAAATGT 1100 

ATCTGATCTT GAAATTAAAA TAGCTTTCGC TTTAGGCAGT GTTATAAATC 1150 

AAGCCTTGAT ATCAAAACAA GGTTCATATC TTACTAACCT AGTAATAGAA 1200 

CAAGTAAAAA ATAGATATCA ATTTTTAAAC CAACACCTTA ACCCAGCCAT 1250 

AGAGTCTGAT AATAACTTCA CAGATACTAC TAAAATTTTT CATGATTCAT 13 00 

TATTTAATTC AGCTACCGCA GAAAACTCTA TGTTTTTAAC AAAAATAGCA 1350 

CCATACTTAC AAGTAGGTTT TATGCCAGAA GCTCGCTCCA CAATAAGTTT 14 00 

AAGTGGTCCA GGAGCTTATG CGTCAGCTTA CTATGATTTC ATAAATTTAC 14 50 

AAGAAAATAC TATAGAAAAA ACTTTAAAAG CATCAGATTT AATAGAATTT 1500 

AAATTCCCAG AAAATAATCT ATCTCAATTG ACAGAACAAG AAATAAATAG 1550 

TCTATGGAGC TTTGATCAAG CAAGTGCAAA ATATCAATTT GAGAAATATG 1600 

TAAGAGATTA TACTGGTGGA TCTCTTTCTG AAGACAATGG GGTAGACTTT 1650 

AATAAAAATA CTGCCCTCGA CAAAAACTAT TTATTAAATA ATAAAATTCC 1700 

ATCAAACAAT GTAGAAGAAG CTGGAAGTAA AAATTATGTT CATTATATCA 1750 

TACAGTTACA AGGAGATGAT ATAAGTTATG AAGCAACATG CAATTTATTT 1800 

TCTAAAAATC CTAAAAATAG TATTATTATA CAACGAAATA TGAATGAAAG 1850 

TGCAAAAAGC TACTTTTTAA GTGATGATGG AGAATCTATT TTAGAATTAA 1900 

ATAAATATAG GATACCTGAA AGATTAAAAA ATAAGGAAAA AGTAAAAGTA 1950 

ACCTTTATTG GACATGGTAA AGATGAATTC AACACAAGCG AATTTGCTAG 2000 

ATTAAGTGTA GATTCACTTT CCAATGAGAT AAGTTCATTT TTAGATACCA 2050 

TAAAATTAGA TATATCACCT AAAAATGTAG AAGTAAACTT ACTTGGATGT 2100 

AATATGTTTA GTTATGATTT TAATGTTGAA GAAACTTATC CTGGGAAGTT 2150 

GCTATTAAGT ATTATGGACA AAATTACTTC CACTTTACCT GATGTAAATA 2200 

AAAATTCTAT TACTATAGGA GCAAATCAAT ATGAAGTAAG AATTAATAGT 2250 

GAGGGAAGAA AAGAACTTCT GGCTCACTCA GGTAAATGGA TAAATAAAGA 2300 

AGAAGCTATT ATGAGCGATT TATCTAGTAA AGAATACATT TTTTTTGATT 23 50 

CTATAGATAA TAAGCTAAAA GCAAAGTCCA AGAATATTCC AGGATTAGCA 24 00 

TCAATATCAG AAGATATAAA AACATTATTA CTTGATGCAA GTGTTAGTCC 24 50 

TGATACAAAA TTTATTTTAA ATAATCTTAA GCTTAATATT GAATCTTCTA 2500 

TTGGGGATTA CATTTATTAT GAAAAATTAG AGCCTGTTAA AAATATAATT 2 550 

CACAATTCTA TAGATGATTT AATAGATGAG TTCAATCTAC TTGAAAATGT 2 600 

ATCTGATGAA TTATATGAAT TAAAAAAATT AAATAATCTA GATGAGAAGT 2 650 

ATTTAATATC TTTTGAAGAT ATCTCAAAAA ATAATTCAAC TTACTCTGTA 2 700 

AGATTTATTA ACAAAAGTAA TGGTGAGTCA GTTTATGTAG AAACAGAAAA 2 750 

AGAAATTTTT TCAAAATATA GCGAACATAT TACAAAAGAA ATAAGTACTA 28 00 

TAAAGAATAG TATAATTACA GATGTTAATG GTAATTTATT GGATAATATA 2850 

CAGTTAGATC ATACTTCTCA AGTTAATACA TTAAACGCAG CATTCTTTAT 2900 

TCAATCATTA ATAGATTATA GTAGCAATAA AGATGTACTG AATGATTTAA 2950 

GTACCTCAGT TAAGGTTCAA CTTTATGCTC AACTATTTAG TACAGGTTTA 3000 

AATACTATAT ATGACTCTAT CCAATTAGTA AATTTAATAT CAAATGCAGT 3 050 

AAATGATACT ATAAATGTAC TACCTACAAT AACAGAGGGG ATACCTATTG 3100 

TATCTACTAT ATTAGACGGA ATAAACTTAG GTGCAGCAAT TAAGGAATTA 3150 

CTAGACGAAC ATGACCCATT ACTAAAAAAA GAATTAGAAG CTAAGGTGGG 3 200 

TGTTTTAGCA ATAAATATGT CATTATCTAT AGCTGCAACT GTAGCTTCAA 3250 

TTGTTGGAAT AGGTGCTGAA GTTACTATTT TCTTATTACC TATAGCTGGT 3300 

ATATCTGCAG GAATACCTTC ATTAGTTAAT AATGAATTAA TATTGCATGA 3350 

TAAGGCAACT TCAGTGGTAA ACTATTTTAA TCATTTGTCT GAATCTAAAA 34 00 

AATATGGCCC TCTTAAAACA GAAGATGATA AAATTTTAGT TCCTATTGAT 3450 

GATTTAGTAA TATCAGAAAT AGATTTTAAT AATAATTCGA TAAAACTAGG 3500 

AACATGTAAT ATATTAGCAA TGGAGGGGGG ATCAGGACAC ACAGTGACTG 3550 

GTAATATAGA TCACTTTTTC TCATCTCCAT CTATAAGTTC TCATATTCCT 3600 

TCATTATCAA TTTATTCTGC AATAGGTATA GAAACAGAAA ATCTAGATTT 3 650 
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TTCAAAAAAA ATAATGATGT TACCTAATGC TCCTTCAAGA GTGTTTTGGT ^700 

GGGAAACTGG AGCAGTTCCA GGTTTAAGAT CATTGGAAAA TGACGGAACT 3750 

AGATTACTTG ATTCAATAAG AGATTTATAC CCAGGTAAAT TTTACTGGAG 3800 

ATTCTATGCT TTTTTCGATT ATGCAATAAC TACATTAAAA CCAGTTTATG 3850 

AAGACACTAA TATTAAAATT AAACTAGATA AAGATACTAG AAACTTCATA 3900 

ATGCCAACTA TAACTACTAA CGAAATTAGA AACAAATTAT CTTATTCATT 3950 

TGATGGAGCA GGAGGAACTT ACTCTTTATT ATTATCTTCA TATCCAATAT 4000 

CAACGAATAT AAATTTATCT AAAGATGATT TATGGATATT TAATATTGAT 4050 

AATGAAGTAA GAGAAATATC TATAGAAAAT GGTACTATTA AAAAAGGAAA 4100 

GTTAATAAAA GATGTTTTAA GTAAAATTGA TATAAATAAA AATAAACTTA 4150 

TTATAGGCAA TCAAACAATA GATTTTTCAG GCGATATAGA TAATAAAGAT 4200 

AGATATATAT TCTTGACTTG TGAGTTAGAT GATAAAATTA GTTTAATAAT 4250 

AGAAATAAAT CTTGTTGCAA AATCTTATAG TTTGTTATTG TCTGGGGATA 43 00 

AAAATTATTT GATATCCAAT TTATCTAATA CTATTGAGAA AATCAATACT 4350 

TTAGGCCTAG ATAGTAAAAA TATAGCGTAC AATTACACTG ATGAATCTAA 44 00 

TAATAAATAT TTTGGAGCTA TATCTAAAAC AAGTCAAAAA AGCATAATAC 44 50 

ATTATAAAAA AGACAGTAAA AATATATTAG AATTTTATAA TGACAGTACA 4 500 

TTAGAATTTA ACAGTAAAGA TTTTATTGCT GAAGATATAA ATGTATTTAT 4550 

GAAAGATGAT ATTAATACTA TAACAGGAAA ATACTATGTT GATAATAATA 5000 

CTGATAAAAG TATAGATTTC TCTATTTCTT TAGTTAGTAA AAATCAAGTA 5050 

AAAGTAAATG GATTATATTT AAATGAATCC GTATACTCAT CTTACCTTGA 5100 

TTTTGTGAAA AATTCAGATG GACACCATAA TACTTCTAAT TTTATGAATT 5150 

TATTTTTGGA CAATATAAGT TTCTGGAAAT TGTTTGGGTT TGAAAATATA 5200 

AATTTTGTAA TCGATAAATA CTTTACCCTT GTTGGTAAAA CTAATCTTGG 5250 

ATATGTAGAA TTTATTTGTG ACAATAATAA AAATATAGAT ATATATTTTG 5300 

GTGAATGGAA AACATCGTCA TCTAAAAGCA CTATATTTAG CGGAAATGGT 53 50 

AGAAATGTTG TAGTAGAGCC TATATATAAT CCTGATACGG GTGAAGATAT 54 00 

ATCTACTTCA CTAGATTTTT CCTATGAACC TCTCTATGGA ATAGATAGAT 54 50 

ATATAAATAA AGTATTGATA GCACCTGATT TATATACAAG TTTAATAAAT 5500 

ATTAATACCA ATTATTATTC AAATGAGTAC TACCCTGAGA TTATAGTTCT 5550 

TAACCCAAAT ACATTCCACA AAAAAGTAAA TATAAATTTA GATAGTTCTT 56 0 0 

CTTTTGAGTA TAAATGGTCT ACAGAAGGAA GTGACTTTAT TTTAGTTAGA 5650 

TACTTAGAAG AAAGTAATAA AAAAATATTA CAAAAAATAA GAATCAAAGG 570 0 

TATCTTATCT AATACTCAAT CATTTAATAA AATGAGTATA GATTTTAAAG 5750 

ATATTAAAAA ACTATCATTA GGATATATAA TGAGTAATTT TAAATCATTT 58 00 

AATTCTGAAA ATGAATTAGA TAGAGATCAT TTAGGATTTA AAATAATAGA 58 50 

TAATAAAACT TATTACTATG ATGAAGATAG TAAATTAGTT AAAGGATTAA 5900 

GCAAGTAATT GACAATAATA AGTATTATTT CAATCCTGAC ACTGCTATCA 5950 

TCTCAAAAGG TTGGCAGACT GTTAATGGTA GTAGATACTA CTTTGATACT 6000 

GATACCGCTA TTGCCTTTAA TGGTTATAAA ACTATTGATG GTAAACACTT 6050 

TTATTTTGAT AGTGATTGTG TAGTGAAAAT AGGTGTGTTT AGTACCTCTA 6100 

ATGGATTTGA ATATTTTGCA CCTGCTAATA CTTATAATAA TAACATAGAA 6150 

GGTCAGGCTA TAGTTTATCA AAGTAAATTC TTAACTTTGA ATGGTAAAAA 62 00 

ATATTACTTT GATAATAACT CAAAAGCAGT TACCGGATTG CAAACTATTG 6250 

ATAGTAAAAA ATATTACTTT AATACTAACA CTGCTGAAGC AGCTACTGGA 6300 

TGGCAAACTA TTGATGGTAA AAAATATTAC TTTAATACTA ACACTGCTGA 6350 

AGCAGCTACT GGATGGCAAA CTATTGATGG TAAAAAATAT TACTTTAATA 64 00 

CTAACACTGC TATAGCTTCA ACTGGTTATA CAATTATTAA TGGTAAACAT 6450 

TTTTATTTTA ATACTGATGG TATTATGCAG ATAGGAGTGT TTAAAGGACC 6500 

TAATGGATTT GAATATTTTG CACCTGCTAA TACGGATGCT AACAACATAG 6550 

AAGGTCAAGC TATACTTTAC CAAAATGAAT TCTTAACTTT GAATGGTAAA 6600 

AAATATTACT TTGGTAGTGA CTCAAAAGCA GTTACTGGAT GGAGAATTAT 6650 

TAACAATAAG AAATATTACT TTAATCCTAA TAATGCTATT GCTGCAATTC 6700 

ATCTATGCAC TATAAATAAT GACAAGTATT ACTTTAGTTA TGATGGAATT 6750 

CTTCAAAATG GATATATTAC TATTGAAAGA AATAATTTCT ATTTTGATGC 6800 

TAATAATGAA TCTAAAATGG TAACAGGAGT ATTTAAAGGA CCTAATGGAT 6850 

TTGAGTATTT TGCACCTGCT AATACTCACA ATAATAACAT AGAAGGTCAG 6 900 
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GCTATAGTTT ACCAGAACAA ATTCTTAACT 
TTTTGATAAT GACTCAAAAG CAGTTACTGG 
AAAAATATTA CTTTAATCTT AACACTGCTG 
ACTATTGATG GTAAAAAATA TTACTTTAAT 
TACTGGATGG CAAACTATTG ATGGTAAAAA 
CTTTCATAGC CTCAACTGGT TATACAAGTA 
TTTAATACTG ATGGTATTAT GCAGATAGGA 
ATTTGAATAC TTTGCACCTG CTAATACGGA 
AAGCTATACT TTACCAAAAT AAATTCTTAA 
TACTTTGGTA GTGACTCAAA AGCAGTTACC 
TAAAAAATAT TACTTTAATA CTAACACTGC 
AAACTATTAA TGGTAAAAAA TACTACTTTA 
TCAACTGGTT ATACAATTAT TAGTGGTAAA 
TGGTATTATG CAGATAGGAG TGTTTAAAGG 
TTGCACCTGC TAATACAGAT GCTAACAATA 
TATCAAAATA GATTCCTATA TTTACATGAC 
TAATTCAAAA GCGGCTACTG GTTGGGTAAC 
ACTTCGAGCC TAATACAGCT ATGGGTGCGA 
AATAAAAATT TTTACTTTAG AAATGGTTTA 
AGGGTCTAAT GGATTTGAAT ACTTTGCACC 
ATATAGAAGG TCAAGCTATA CGTTATCAAA 
GGAAAAATAT ATTACTTTGG TAATAATTCA 
AACTATTAAT GGTAAAGTAT ATTACTTTAT 
CAGCTGGTGG ACTTTTCGAG ATTGATGGTG 
GATGGAGTAA AAGCCCCTGG GATATATGGC 



PCT/CA00/01150 

TTGAATGGCA AAAAATATTA 6*950 

ATGGCAAACC ATTGATGGTA 7000 

AAGCAGCTAC TGGATGGCAA 7050 

CTTAACACTG CTGAAGCAGC 7100 

ATATTACTTT AATACTAACA 7150 

TTAATGGTAA ACATTTTTAT 7200 

GTGTTTAAAG GACCTAATGG 7250 

TGCTAACAAC ATAGAAGGTC 7300 

CTTTGAATGG TAAAAAATAT 7350 

GGACTGCGAA CTATTGATGG 7400 

TGTTGCAGTT ACTGGATGGC 74 50 

ATACTAACAC TTCTATAGCT 7500 

CATTTTTATT TTAATACTGA 7550 

ACCTGATGGA TTTGAATACT 7600 

TAGAAGGTCA AGCTATACGT 7650 

AATATATATT ATTTTGGTAA 7700 

TATTGATGGT AATAGATATT 7750 

ATGGTTATAA AACTATTGAT 7800 

CCTCAGATAG GAGTGTTTAA 7850 

TGCTAATACG GATGCTAACA 7 900 

ATAGATTCCT ACATTTACTT 7950 

AAAGCAGTTA CTGGATGGCA 8 000 

GCCTGATACT GCTATGGCTG 8050 

TTATATATTT CTTTGGTGTT 8100 

TAA 8133 



2) INFORMATION FOR SEQ ID NO: 2130 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: VPI 10463 

(C) ACCESSION NUMBER: X53138 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2130 

ATGAGTTTAG TTAATAGAAA ACAGTTAGAA AAAATGGCAA ATGTAAGATT 50 

TCGTACTCAA GAAGATGAAT ATGTTGCAAT ATTGGATGCT TTAGAAGAAT 100 

ATCATAATAT GTCAGAGAAT ACTGTAGTCG AAAAATATTT AAAATTAAAA 150 

GATATAAATA GTTTAACAGA TATTTATATA GATACATATA AAAAATCTGG 200 

TAGAAATAAA GCCTTAAAAA AATTTAAGGA ATATCTAGTT ACAGAAGTAT 250 

TAGAGCTAAA GAATAATAAT TTAACTCCAG TTGAGAAAAA TTTACATTTT 300 

GTTTGGATTG GAGGTCAAAT AAATGACACT GCTATTAATT ATATAAATCA 3 50 

ATGGAAAGAT GTAAATAGTG ATTATAATGT TAATGTTTTT TATGATAGTA 4 00 

ATGCATTTTT GATAAACACA TTGAAAAAAA CTGTAGTAGA ATCAGCAATA 4 50 

AATGATACAC TTGAATCATT TAGAGAAAAC TTAAATGACC CTAGATTTGA 500 

CTATAATAAA TTCTTCAGAA AACGTATGGA AATAATTTAT GATAAACAGA 550 

AAAATTTCAT AAACTACTAT AAAGCTCAAA GAGAAGAAAA TCCTGAACTT 600 
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ATAATTGATG ATATTGTAAA 
GATAGATGAA CTTAATACCT 
AGAATAGTGG AAATGATGTT 
TCATTCAACT TATATGAACA 
TGCTTCTGAC ATATTAAGAA 
ATTTAGATGT TGATATGTTA 
ATAGAGAAAC CTAGTTCAGT 
AGAAGCTATA ATGAAATACA 
ATTTTGACAT GTTAGACGAA 
GCTTCTAAGT CAGATAAATC 
GGCATCACCA CTAGAAGTTA 
ATCAAGGGCT AATTTCTGTG 
AAACAAATCG AGAATAGATA 
TATTAGCGAG GATAATGATT 
GTATAATGGC TGAAGCTAAT 
GGAAAGTATT TAAGAGTTGG 
CTTAAGTGGC CCTGAAGCAT 
TTAAAGAAGG CAGTATGAAT 
TTTGAAATCT CTAAAACTAA 
TAGCTTATGG TCATTTGACG 
ATAAAAGGAA TTATTTTGAA 
TTTTCTCAAA ATATAGTAGT 
TTCATTAGCA AGAAGTTCAG 
TACAAGGAGA TAAAATTAGT 
ACTCCTTATG ATAGTGTACT 
TGCATATTAT TATAATCCTG 
ATAAAATTCC AAGTATAATT 
ATTGGTCATG GTAAAGATGA 
TGTAGATTCA TTATCCACAG 
AGGATATTTC TCCTAAGTCA 
TTTAGCTACT CTATCAACGT 
TAAAGTTAAA GATAAAATAT 
CTATTATAGT AAGTGCAAAT 
AGAAGAGAAT TATTGGATCA 
TATTATAAAG GATATTTCAT 
AAAATAAAAT TACAGTAAAA 
TTACAAGAAA TTAGAAATAA 
AAAAGTAATG TTAACAGAAT 
CGCAAATTGT TGAGGAAAGG 
TCTATTAATT ATATAAAAGA 
TGCACTATGT GACTTAAAAC 
TATCTTTTGA GGACATATCA 
ATTAATAAAG AAACTGGAGA 
ATTCTCTGAA TATGCTAATC 
GTACTATATT TGATACTGTA 
GATACTACAC ACGAAGTAAA 
ATTAATAGAA TATAATAGTT 
CAATGAAAGT CCAAGTTTAC 
ATTACAGATG CAGCCAAAGT 
AACTATAGAC TTACTTCCTA 
CTATTATAGA TGGTGTAAGT 
ACGAGTGACC CATTATTAAG 
GGCAGTAAAT TTAACAACAG 
GGATAGCTAG TGGATTTAGT 
GCAGGTATAC CAAGCTTAGT 
AACAAAGGTT GTAGATTATT 
GAGTATTTAC TTTATTAGAT 



GACATATCTT TCAAATGAGT 
ATATTGAAGA ATCCTTAAAT 
AGAAACTTTG AAGAATTTAA 
AGAGTTGGTA GAAAGGTGGA 
TATCTGCATT AAAAGAAATT 
CCAGGAATAC AACCAGACTT 
AACAGTGGAT TTTTGGGAAA 
AAGAATATAT ACCAGAATAT 
GAAGTTCAAA GTAGTTTTGA 
AGAAATATTC TCATCACTTG 
AAATTGCATT TAATAGTAAG 
AAAGACTCAT ATTGTAGCAA 
TAAAATATTG AATAATAGTT 
TTAATACTAC, AACGAATACC 
gcagataatg' GTAGATTTAT 
TTTCTTCCCA GATGTTAAAA 
ATGCGGCAGC TTATCAAGAT 
ATCCATTTGA TAGAAGCTGA 
TATTTCTCAA TCAACTGAAC 
ATGCAAGAGC TAAAGCTCAA 
GGTTCTCTTG GTGAAGATGA 
TGACAAGGAG TATCTTTTAG 
AGAGAGGATA TATACACTAT 
TATGAAGCAG CATGTAACTT 
GTTTCAGAAA AATATAGAAG 
GAGATGGTGA AATACAAGAA 
TCTGATAGAC CTAAGATTAA 
ATTTAATACT GATATATTTG 
AAATAGAAGC AGCAATAGAT 
ATAGAAATAA ATTTATTAGG 
AGAGGAGACT TATCCTGGAA 
CAGAATTAAT GCCATCTATA 
CAATATGAAG TTAGAATAAA 
TTCTGGTGAA TGGATAAATA 
CAAAAGAATA TATATCATTT 
TCTAAAAATT TACCTGAGCT 
TTCTAATTCA AGTGATATTG 
GTGAGATAAA TGTTATTTCA 
ATTGAAGAAG CTAAGAATTT 
TGAATTTAAA CTAATAGAAT 
AACAGAATGA ATTAGAAGAT 
GAGACTGATG AGGGATTTAG 
ATCTATATTT GTAGAAACTG 
ATATAACTGA AGAGATTTCT 
AATGGTAAGT TAGTAAAAAA 
TACTTTAAAT GCTGCATTTT 
CTAAAGAATC TCTTAGTAAT 
GCTCAATTAT TTAGTACTGG 
TGTTGAATTA GTATCAACTG 
CATTATCTGA AGGATTACCT 
TTAGGTGCAG CAATCAAAGA 
ACAAGAAATA GAAGCTAAGA 
CTACAACTGC AATCATTACT 
ATACTTTTAG TTCCTTTAGC 
AAACAATGAA CTTGTACTTC 
TTAAACATGT TTCATTAGTT 
GATAAAATAA TGATGCCAGA 
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ATTCAAAGGA 


650 


AAAATTACAC 


700 


AAATGGAGAG 


750 


ATTTAGCTGC 


800 


GGTGGTATGT 


850 


ATTTGAGTCT 


900 


TGACAAAGTT 


950 


ACCTCAGAAC 


1000 


ATCTGTTCTA 


1050 


GTGATATGGA 


1100 


GGTATTATAA 


1150 


TTTAATAGTA 


1200 


TAAATCCAGC 


1250 


TTTATTGATA 


1300 


GATGGAACTA 


1350 


CTACTATTAA 


1400 


TTATTAATGT 


1450 


TTTAAGAAAC 


1500 


AAGAAATGGC 


1550 


TTTGAAGAAT 


1600 


TAATCTTGAT 


1650 


AAAAAATATC 


1700 


ATTGTTCAGT 


1750 


ATTTGCAAAG 


1800 


ATTCAGAAAT 


1850 


ATAGACAAGT 


1900 


ATTAACATTT 


1950 


CAGGTTTTGA 


2000 


TTAGCTAAAG 


2050 


ATGTAATATG 


2100 


AATTATTACT 


2150 


AGTCAAGACT 


2200 


TAGTGAAGGA 


2250 


AAGAAGAAAG 


2300 


AATCCTAAAG 


2350 


ATCTACATTA 


2400 


AACTAGAAGA 


2450 


AATATAGATA 


2500 


AACTTCTGAC 


2550 


CTATTTCTGA 


2600 


TCTCATTTTA 


2650 


TATAAGATTT 


2700 


AAAAAACAAT 


2750 


AAGATAAAAG 


2800 


AGTAAATTTA 


2850 


TTATACAATC 


2900 


TTAAGTGTAG 


2950 


TTTAAATACT 


3000 


CATTAGATGA 


3050 


ATAATTGCAA 


3100 


GCTAAGTGAA 


3150 


TAGGTATAAT 


3200 


TCATCTTTGG 


3250 


AGGAATTTCA 


3300 


GAGATAAGGC 


3350 


GAAACTGAAG 


3400 


AGATGATTTA 


3450 
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GTGATATCAG AAATAGATTT TAATAATAAT TCAATAGTTT TAGGTAAATG 3500 

TGAAATCTGG AGAATGGAAG GTGGTTCAGG TCATACTGTA ACTGATGATA 3550 

TAGATCACTT CTTTTCAGCA CCATCAATAA CATATAGAGA GCCACACTTA 3 600 

TCTATATATG ACGTATTGGA AGTACAAAAA GAAGAACTTG ATTTGTCAAA 3650 

AGATTTAATG GTATTACCTA ATGCTCCAAA TAGAGTATTT GCTTGGGAAA 3700 

CAGGATGGAC ACCAGGTTTA AGAAGCTTAG AAAATGATGG CACAAAACTG 3750 

TTAGACCGTA TAAGAGATAA CTATGAAGGT GAGTTTTATT GGAGATATTT 3800 

TGCTTTTATA GCTGATGCTT TAATAACAAC ATTAAAACCA AGATATGAAG 3850 

ATACTAATAT AAGAATAAAT TTAGATAGTA ATACTAGAAG TTTTATAGTT 3900 

CCAATAATAA CTACAGAATA TATAAGAGAA AAATTATCAT ATTCTTTCTA 3950 

TGGTTCAGGA GGAACTTATG CATTGTCTCT TTCTCAATAT AATATGGGTA 4000 

TAAATATAGA ATTAAGTGAA AGTGATGTTT GGATTATAGA TGTTGATAAT 4050 

GTTGTGAGAG ATGTAACTAT AGAATCTGAT AAAATTAAAA AAGGTGATTT 4100 

AATAGAAGGT ATTTTATCTA CACTAAGTAT TGAAGAGAAT AAAATTATCT 4150 

TAAATAGCCA TGAGATTAAT TTTTCTGGTG AGGTAAATGG AAGTAATGGA 4200 

TTTGTTTCTT TAACATTTTC AATTTTAGAA GGAATAAATG CAATTATAGA 4250 

AGTTGATTTA TTATCTAAAT CATATAAATT ACTTATTTCT GGCGAATTAA 4300 

AAATATTGAT GTTAAATTCA AATCATATTC AACAGAAAAT AGATTATATA 43 50 

GGATTCAATA GCGAATTACA GAAAAATATA CCATATAGCT TTGTAGATAG 44 00 

TGAAGGAAAA GAGAATGGTT TTATTAATGG TTCAACAAAA GAAGGTTTAT 4450 

TTGTATCTGA ATTACCTGAT GTAGTTCTTA TAAGTAAGGT TTATATGGAT 4500 

GATAGTAAGC CTTCATTTGG ATATTATAGT AATAATTTGA AAGATGTCAA 4550 

AGTTATAACT AAAGATAATG TTAATATATT AACAGGTTAT TATCTTAAGG 4600 

ATGATATAAA AATCTCTCTT TCTTTGACTC TACAAGATGA AAAAACTATA 4650 

AAGTTAAATA GTGTGCATTT AGATGAAAGT GGAGTAGCTG AGATTTTGAA 4700 

GTTCATGAAT AGAAAAGGTA ATACAAATAC TTCAGATTCT TTAATGAGCT 4750 

TTTTAGAAAG TATGAATATA AAAAGTATTT TCGTTAATTT CTTACAATCT 4800 

AATATTAAGT TTATATTAGA TGCTAATTTT ATAATAAGTG GTACTACTTC 4 850 

TATTGGCCAA TTTGAGTTTA TTTGTGATGA AAATGATAAT ATACAACCAT 4900 

ATTTCATTAA GTTTAATACA CTAGAAACTA ATTATACTTT ATATGTAGGA 4 950 

AATAGACAAA ATATGATAGT GGAACCAAAT TATGATTTAG ATGATTCTGG 5000 

AGATATATCT TCAACTGTTA TCAATTTCTC TCAAAAGTAT CTTTATGGAA 5050 

TAGACAGTTG TGTTAATAAA GTTGTAATTT CACCAAATAT TTATACAGAT 5100 

GAAATAAATA TAACGCCTGT ATATGAAACA AATAATACTT ATCCAGAAGT 5150 

TATTGTATTA GATGCAAATT ATATAAATGA AAAAATAAAT GTTAATATCA 5200 

ATGATCTATC TATACGATAT GTATGGAGTA ATGATGGTAA TGATTTTATT 5250 

CTTATGTCAA CTAGTGAAGA AAATAAGGTG TCACAAGTTA AAATAAGATT 53 00 

CGTTAATGTT TTTAAAGATA AGACTTTGGC AAATAAGCTA TCTTTTAACT 5350 

TTAGTGATAA ACAAGATGTA CCTGTAAGTG AAATAATCTT ATCATTTACA 54 00 

CCTTCATATT ATGAGGATGG ATTGATTGGC TATGATTTGG GTCTAGTTTC 5450 

TTTATATAAT GAGAAATTTT ATATTAATAA CTTTGGAATG ATGGTATCTG 5500 

GATTAATATA TATTAATGAT TCATTATATT ATTTTAAACC ACCAGTAAAT 5550 

AATTTGATAA CTGGATTTGT GACTGTAGGC GATGATAAAT ACTACTTTAA 560 0 

TCCAATTAAT GGTGGAGCTG CTTCAATTGG AGAGACAATA ATTGATGACA 5650 

AAAATTATTA TTTCAACCAA AGTGGAGTGT TACAAACAGG TGTATTTAGT 5700 

ACAGAAGATG GATTTAAATA TTTTGCCCCA GCTAATACAC TTGATGAAAA 5750 

CCTAGAAGGA GAAGCAATTG ATTTTACTGG AAAATTAATT ATTGACGAAA 5800 

ATATTTATTA TTTTGATGAT AATTATAGAG GAGCTGTAGA ATGGAAAGAA 5850 

TTAGATGGTG AAATGCACTA TTTTAGCCCA GAAACAGGTA AAGCTTTTAA 5900 

AGGTCTAAAT CAAATAGGTG ATTATAAATA CTATTTCAAT TCTGATGGAG 5950 

TTATGCAAAA AGGATTTGTT AGTATAAATG ATAATAAACA CTATTTTGAT 6000 

GATTCTGGTG TTATGAAAGT AGGTTACACT GAAATAGATG GCAAGCATTT 6050 

CTACTTTGCT GAAAACGGAG AAATGCAAAT AGGAGTATTT AATACAGAAG 6100 

ATGGATTTAA ATATTTTGCT CATCATAATG AAGATTTAGG AAATGAAGAA 6150 

GGTGAAGAAA TCTCATATTC TGGTATATTA AATTTCAATA ATAAAATTTA 6200 

CTATTTTGAT GATTCATTTA CAGCTGTAGT TGGATGGAAA GATTTAGAGG 6250 

ATGGTTCAAA GTATTATTTT GATGAAGATA CAGCAGAAGC ATATATAGGT 6300 
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TTGTCATTAA TAAATGATGG TCAATATTAT 
GCAAGTTGGA TTTGTCACTA TAAATGATAA 
- CTGGAATTAT AGAATCTGGA GTACAAAACA 
ATAGATGATA ATGGTATAGT TCAAATTGGT 
ATATAAATAT TTTGCACCTG CTAATACTGT 
AAGCAGTTGA ATATAGTGGT TTAGTTAGAG 
TTTGGAGAAA CATATACAAT TGAGACTGGA 
TGAAAGTGAT AAATATTATT TCAATCCAGA 
GTATTAATTT AATTGATGAT ATAAAATATT 
ATGAGAACGG GTCTTATATC ATTTGAAAAT 
GAATGGTGAA ATGCAATTTG GTTATATAAA 
ATTTTGGTGA AGATGGTGTC ATGCAGATTG 
GGATTTAAAT ACTTTGCACA TCAAAATACT 
AGAATCAATA AACTATACTG GTTGGTTAGA 
ATTTTACAGA TGAATATATT GCAGCAACTG 
GAGGAGTATT ATTTTGATCC TGATACAGCT 
G 
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TTTAATGATG ATGGAATTAT 63 50 

AGTCTTCTAC TTCTCTGACT 64 00 

TAGATGACAA TTATTTCTAT 64 50 

GTATTTGATA CTTCAGATGG 6500 

AAATGATAAT ATTTACGGAC 6550 

TTGGGGAAGA TGTATATTAT 6600 

TGGATATATG ATATGGAAAA 6650 

AACTAAAAAA GCATGCAAAG 6700 

ATTTTGATGA GAAGGGCATA 6750 

AATAATTATT ACTTTAATGA 6800 

TATAGAAGAT AAGATGTTCT 6850 

GAGTATTTAA TACACCAGAT 6900 

TTGGATGAGA ATTTTGAGGG 6950 

TTTAGATGAA AAGAGATATT 7000 

GTTCAGTTAT TATTGATGGT 7050 

CAATTAGTGA TTAGTGAATA 7100 

7101 



2) INFORMATION FOR SEQ ID NO: 2131 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2131 
AAGCMATTGT TGTAAATTTT GAAAG 25 



2) INFORMATION FOR SEQ ID NO: 2132 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2132 
TCATATCCAT AGCAATAGTT CTA 23 



2) INFORMATION FOR SEQ ID NO: 2133 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2133 
CCACGCACAW AAACTTGTTT TAGAAGTAGC AGCWCAGCGT GG 42 



2) INFORMATION FOR SEQ ID NO: 2134 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2134 
CGCGTGAAGC TTCTGTG 17 



2) INFORMATION FOR SEQ ID NO: 2135 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2135 
TCTCGCCGTT ATTCAGTTTC 20 



2) INFORMATION FOR SEQ ID NO: 2136 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2136 
CCAACGCGTC CTCAATCATT TCTAACTTCT ATGGCCGGCG TTGG 44 
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2) INFORMATION FOR SEQ ID NO: 2137 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 889 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas putida 

(C) ACCESSION NUMBER: Genome project 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2137 

ATGATCACCG GTGCTGCCCA GATGGACGGC GCGATCCTGG TTTGCTCGGC 50 

CGCCGATGGT CCGATGCCrC AAACCCGTGA GCACATCCTG CTGTCCCGTC 100 

AGGTAGGCGT TCCGTACATC GTGGTCTTCC TGAACAAGGC CGACCTGGTA 150 

GACGACGCTG AGCTGCTGGA ACTGGTCGAG ATGGAAGTTC GCGACCTGCT 200 

GTCCACCTAC GACTTCCCAG GCGACGACAC TCCGATCATC ATCGGTTCCG 2 50 

CTCGTATGGC CCTGGAAGGC AAAGACGACA ACGAAATGGG CACTACCGCT 300 

GTCAAGAAGC TGGTAGAAAC TCTGGATGCC TACATCCCTG AGCCAGTTCG 3 50 

TGCCATCGAC CAGCCGTTCC TGATGCCGAT CGAAGACGTG TTCTCGATCT 400 

CGGGTCGTGG TACCGTTGTT ACCGGTCGTA TCGAGCGTGG CATCGTTCGC 4 50 

GTTCAGGATC CGCTGGAAAT CGTTGGTCTG CGTGACACCA CCACCACCAC 500 

CTGCACCGGT GTTGAGATGT TCCGCAAGCT GCTGGACGAA GGTCGTGCTG 550 

GCGAGAACTG CGGCGTTCTG CTGCGTGGTA CCAAGCGTGA CGACGTTGAG 600 

CGTGGCCAGG TTCTGGTCAA GCCAGGTTCG GTCAAGCCGC ACACCAAGTT 650 

CACCGCAGAA GTCTACGTCC TGTCGAAGGA AGAAGGCGGC CGTCACACTC 700 

CGTTCTTCAA AGGCTACCGT CCACAGTTCT ACTTCCGTAC CACTGACGTG 750 

ACCGGTAACT GCGAACTGCC GGAAGGCGTT GAAATGGTAA TGCCAGGTGA 800 

CAACATTCAG ATGACTGTTA CCCTGATCAA GACCATCGCA ATGGAAGACG 850 

GTCTGCGCTT CGCTATCCGT GAAGGCGGTC GTACCGTCG 889 



2) INFORMATION FOR SEQ ID NO: 2138 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: PAO-1 

(C) ACCESSION NUMBER: Genome project 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2138 
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AACATGATCA 
GGCTGCCGAC 
GCCAGGTAGG 
GTCGACGACG 
GCTGAACACC 
CCGCGCTGAT 
GCCGTGCAGA 
TCGTGCCATC 
TCTCCGGCCG 
AAGGTCCAGG 
TACCTGCACC 
CTGGTGAGAA 
GAGCGTGGCC 
GTTCGAGTGC 
CCCCGTTCTT 
GTGACCGGTA 
CGACAACATC 
ATGGCCTGCG 



CCGGTGCTGC 
GGCCCCATGC 
CGTTCCCTAC 
CCGAGCTGCT 
TACGACTTCC 
GGCGCTGGAA 
AGCTGGTAGA 
GACCAGCCGT 
CGGTACCGTG 
AAGAAGTGGA 
GGCGTTGAAA 
CGTTGGTATC 
AGGTTCTGGC 
GAAGTGTACG 
CAAGGGCTAC 
ACTGCGAACT 
AAGATGGTTG, 
CTTCGCGATC 



CCAGATGGAC 
CGCAGACCCG 
ATCGTCGTGT 
GGAACTGGTC 
CGGGCGACGA 
GGCAAGGATG 
GACCCTGGAC 
TCCTGATGCC 
GTAACCGGTC 
AATCGTCGGC 
TGTTCCGCAA 
CTGCTGCGTG 
CAAGCCGGGC 
TGCTGTCCAA 
CGTCCGCAGT 
GCCGGAAGGC 
TCACCCTGAT 
CGCGAAGGCG 



GGCGCGATCC 
CGAGCACATC 
TCCTGAACAA 
GAGATGGAAG 
CACTCCGATC 
ACAACGGCAT 
TCCTACATTC 
GATCGAAGAC 
GTGTAGAGCG 
ATCAAGGCGA 
GCTGCTCGAC 
GCACCAAGCG 
ACCATCAAGC 
GGAAGAAGGT 
TCTACTTCCG 
GTAGAGATGG 
CGCTCCGATC 
GCCGTACCGT 



TGGTTTGCTC 
CTGCTGTCCC 
GGCCGACATG 
TTCGCGATCT 
ATCATCGGTT 
CGGCGTAAGC 
CGGAGCCGGT 
GTGTTCTCGA 
CGGCATCATC 
CCACCAAGAC 
GAAGGTCGTG 
TGAAGACGTA 
CGCACACCAA 
GGTCGTCACA 
TACCACCGAC 
TAATGCCGGG 
GCCATGGAAG 
TGGCGCCGGC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 



2) INFORMATION FOR SEQ ID NO: 2139 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1398 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni 

(B) STRAIN: NCTC 11168 

(C) ACCESSION NUMBER: AL139074 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2139 

ATGCAAGGAT TTATTTCACA GGTATTAGGT CCGGTTGTTG ATGTAGATTT 50 

TAACGACTAT TTGCCTCAAA TTAATGAAGC AATTGTTGTA AATTTTGAAA 100 

GCGAAGGAAA AAAACATAAA CTTGTTTTAG AAGTAGCAGC TCATTTAGGA 150 

GATAATAGAG TTAGAACTAT TGCTATGGAT ATGACAGATG GTTTGGTAAG 200 

GGGCTTAAAA GCTGAGGCTT TAGGTGCTCC CATTAGTGTT CCTGTTGGTG 250 

AGAAAGTTTT AGGAAGAATT TTCAATGTTA CTGGAGATTT GATCGATGAA 300 

GGTGAAGAAA TTTCTTTCGA TAAAAAATGG GCAATTCATA GAGATCCACC 350 

AGCTTTTGAA GATCAAAGCA CAAAAAGTGA GATTTTTGAA ACAGGGATTA 400 

AAGTTGTAGA TTTACTTGCT CCTTATGCAA AAGGTGGTAA AGTAGGTCTT 450 

TTTGGTGGTG CAGGTGTTGG TAAAACTGTT ATTATTATGG AGCTTATTCA 500 

CAATGTTGCA TTTAAGCATA GCGGCTATTC TGTATTTGCA GGTGTGGGTG 550 

AGAGAACTCG TGAAGGAAAT GACCTTTATA ATGAAATGAA AGAAAGTAAT 600 

GTTTTAGACA AAGTTGCTCT ATGTTATGGA CAAATGAATG AACCACCAGG 650 

GGCAAGAAAT CGTATTGCTT TAACAGGTTT AACAATGGCT GAGTATTTTA 700 

GAGATGAAAT GGGTCTTGAT GTGCTTATGT TTATTGATAA TATCTTTAGA 750 

TTTTCACAAT CAGGTTCTGA AATGTCAGCA CTTTTAGGAA GAATTCCATC 800 

AGCTGTGGGT TATCAACCAA CCTTAGCAAG TGAAATGGGT AAATTCCAAG 850 

AAAGAATTAC TTCAACTAAA AAAGGCTCAA TCACTTCAGT TCAAGCTGTT 900 

TACGTTCCAG CTGATGACTT AACAGACCCA GCTCCAGCAA CTGTTTTTGC 950 
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TCACTTAGAT GCTACAACGG TTTTAAATAG 
TTTATCCTGC AGTTGACCCA CTTGACTCAA 
AATATCATTG GAGAAGAACA TTATAAAGTT 
GCTTCAAAAA TACAAAGATT TACAAGATAT 
ATGAGCTTAG CGAAGAGGAT AAACTTGTAG 
GAAAAATTCT TATCACAACC ATTTTTCGTT 
CCCAGGAAAA TATATAAGCC TTGAAGATAC 
TTTTAGAAGG TAAATATGAT CATTTACCAG 
GGAAATATAG ATGAAGCTAT TGCAAAAGCG 



AGCTATTGCT GAAAAGGGTA 1000 

CTTCAAGAAT GCTTGACCCA 1050 

GCTCGTGGTG TTCAATCAGT 1100 

CATCGCCATT TTAGGTATGG 1150 

TTGAAAGAGC AAGAAAGATT 1200 

GCAGAAGTTT TCACAGGTAG 1250 

AATAGCAGGA TTTAAGGGAA 1300 

AAAATGCTTT CTATATGGTT 13 50 

GATAAATTAA AAGGTTAA 13 98 



2) INFORMATION FOR SEQ ID NO: 2140 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 2157 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 670 

(C) ACCESSION NUMBER: X67867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2140 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTACGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACTTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 

TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 250 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 3 00 

ATTGATTCCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGTAA 3 50 

TTCCCTCCAA GGTGGATCAA CTCTCACCCA ACAGTTGATT AAGTTGACTT 4 00 

ACTTTTCAAC CTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 4 50 

GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAAC AGGAAATCTT 500 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGCAAC TATGGAATGC 550 

AGACAGCAGC TCAAAACTAC TATGGTAAAG ACCTCAATAA TTTAAGTTTA 600 

CCTCAGTTAG CCTTGCTGGC TGGAATGCCT CAGGCACCAA ACCAATATGA 650 

CCCCTATTCA CATCCAGAAG CAGCCCAAGA CCGCCGAAAC TTGGTCTTAT 700 

CTGAAATGAA AAATCAAGGT TACATCTCTG CTGAACAGTA TGAGAAAGCA 750 

GTCAATACAC CAATTACTGA TGGACTACAA AGTCTCAAAT CAGCAAGTAA 800 

TTACCCTGCT TACATGGATA ATTACCTCAA GGAGGTCATC AATCAAGTAG 850 

AACAAGAAAC TGGCTATAAC CTTCTAACTA CTGGGATGGA TGTTTACACA 900 

AATGTAGACC AAGAAGCTCA AAAACATCTG TGGGATATCT ACAACTCCGA 950 

TCAATACGTC TCTTACCCTG ACGATGATTT GCAAGTCGCA TCTACGGTCG 1000 

TAGATGTTTC AAATGGTAAA GTCATCGCCC AACTTGGAGC TCGTCACCAA 1050 

GCAAGTAACG TTTCATTTGG TACCAACCAA GCTGTGGAAA CCAATCGTGA 1100 

CTGGGGTTCT GCTATGAAAC CAATCACCGA TTATGCACCT GCCATAGAAT 1150 

ACGGTGTTTA TGATTCCACT GCAACTATGG TTAATGATAT TCCTTATAAC 1200 

TATCCGGGAA CAAGCACACC TGTCTACAAC TGGGATAGAG CATATTTCGG 12 50 

TAATATTACT CTGCAATATG CTCTTCAACA ATCACGAAAT GTCACAGCCG 1300 

TTGAGACTTT GAATAAGGTC GGTCTAGATA GAGCTAAAAC CTTCCTTAAT 1350 

GGTCTTGGTA TCGACTATCC AAGCATGCAT TATGCAAACG CCATTTCAAG 14 00 
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TAATACAACA GAATCTAATA AACAATACGG 
CTGCTGCTTA TGCTGCCTTT GCCAACGGTG 
TATATCAATA AGGTCGTCTT CAGTGACGGT 
TGTAGGTACA CGAGCTATGA AAGAAACAAC 
TGATGAAAAC TGTCTTGGCA TACGGAACTG 
TGGTTAGCGC AAGCTGGTAA GACAGGTACT 
AATTGAAAAA CACATCAAGA ACACTGGCTA 
TTGTTGGTTA TACTCGTAAG TATTCTATGG 
AATCGTTTAA CTCCTATCGT TGGAGATGGT 
TTATCGCTCA ATGATAACGT ATCTATCAGA 
GGACGATGCC AGACGGACTT TTCAGAAACG 
GGAGCTCGTT CTACGTGGAG CTCACCTGCT 
TGAAAGTTCA AGCTCATCAT CAGATAGTTC 
CCACTCCAAG CACAAATAAT AGTACGACTA 
CAACAATCAA ATACAACCCC TGATCAACAA 
ACAACCA 
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AGCAAGTAGT GAAAAAATGG 14 50 

GAATTTACCA CAAACCCATG 1500 

AGTAAAAAAG AATTTTCAGA 1550 

TGCTTACATG ATGACCGAAA 1600 

GTCGTGGAGC CTATCTCCCA 1650 

TCTAACTACA CAGATGATGA 1700 

TGTAGCTCCA GATGAAATGT 1750 

CTGTATGGAC AGGTTATTCG 1800 

TTCCTAGTTG CAGCTAAAGT 1850 

AGATACTCAT CCAGAAGACT 1900 

GGGAATTTGT ATTCAAAAAT 1950 

CCACAACAAC CCCCATCAAC 2000 

AACTTCACAG TCTAGCTCAA 2050 

CCGATCCTAA CAATAATACG 2100 

AATCAGAATC CTCAACCAGC 2150 

2157 



2) INFORMATION FOR SEQ ID NO: 2141 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRAN DEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2141 
CACCAGTCAC AGAAAAGC 18 



2) INFORMATION FOR SEQ ID NO: 2142 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2142 
GCCTTAATTT CGGATAGTGC 20 



2) INFORMATION FOR SEQ ID NO: 2143 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2143 
GAGAAAGAGC CCAATTATCT AATGT 



2) INFORMATION FOR SEQ ID NO: 2144 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3075 bases 

(B) TYPE: • Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: J2870 

(C) ACCESSION NUMBER: X75439 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2144 

TTGACAAAGA AATATTTAAA CACCCAGAAT GAAATATCAG CATTTTGGAA 50 

TACTCAAAAG ATATTTAAAA AATCAATTGA CAATAGAAAA GGACAGGAAA 100 

GTTTTGTTTT TTATGACGGC CCCCCAACTG CAAATGGCCT TCCTCATGCT 150 

GGCCATGTTC TTGGAAGAGT AATCAAGGAT TTAGTTGCAA GATTAAAAAC 200 

TATGCAAGGT TTTTATGTAG AAAGAAAAGC AGGATGGGAT ACCCATGGCT 250 

TACCAGTTGA ATTAGAGGTT GAAAAAAAAA TTGGAATTAA AGGAAAACAA 300 

GACATTGAAA AGTATGGAAT AGAAAATTTT ATAAATGAAT GTAAAAAAAG 350 

TGTATTTAAT TATGAAAAAG AATGGCGGGA TTTTTCTAAA GATTTAGGAT 4 00 

ACTGGGTTGA CATGGACTCC CCCTATATAA CTCTTGAGAA TAATTATATT 450 

GAAAGTGTAT GGAATATATT ATCTACATTC CATAAAAAAG GACTATTATA 500 

TAAGGGACAT AAGGTGACTC CTTATTGTAC ACATGATCAA ACCGCTTTAA 550 

GTTCTCATGA AGTAGCGCAA GGCTATAAAA ACGTTAAAGA TTTATCAGCT 600 

GTTGTTAAAT TTCAACTTAC AAATAGTAAA GATACTTATT TCTTAAGTTG 650 

GACTACCACT CCCTGGACTT TGCCTGCAAA TGTAGCATTA GCTATAAATA 700 

AAGATCTTAA TTATTCAAAA ATTCGGGTAG AAAATGAGTA TTATATCTTA 750 

GCTACAGATC TAATTAATTC TATAATAACT GAAAAATACG AAATTATTGA 800 

TACCTTTTCA GGAAGTAATT TAATTAATTT AAAATACATT CCTCCTTTTG 850 

AAAGCGA CGG TTTAGTTAAT GCATATTACG TTGTTGATGG AGAATTTGTT 900 

ACTAACTCAG AAGGAACTGG TATTGTTCAT ATAGCACCAG CTCATGGGGA 950 

AGATGACTAC CAATTGGTTT TAGAGCGTGA TTTGGATTTC TTAAATGTTA 1000 

TAACAAGAGA AGGAGTATAT AATGATAGGT TCCCTGAATT AGTTGGTAAT 1050 

AAAGCTAAAA ATAGTGATAT AGAAATCATA AAATTATTAT CCAAAAAACA 1100 

ACTTTTATAT AAAAAACAAA AATATGAGCA TAATTATCCT CATTGTTGGA 1150 

GATGTGGTAA TCCTTTGATA TATTATGCGA TGGAAGGTTG GTTTATTAAA 1200 

ACAACTAATT TTAAGAATGA AATTATTAAC AATAATAATA ATATAGAGTG 1250 

GTTTCCTTCT CATATTAAGG AAGGGAGAAT GGGAAATTTC TTAGAAAATA 1300 

TGGTTGATTG GAACATTGGT AGAAATAGAT ATTGGGGAAC ACCATTAAAT 13 50 

GTATGGATTT GCAATGATTG TAATCACGAA TACGCACCAA GTAGTATTAA 14 00 

GGATTTACAA AATAATTCCA TCAATAAAAT TGATGAAGAT ATTGAGTTGC 14 50 

ATAGACCTTA TGTTGATAAT ATCACTCTTA GTTGCCCTAA GTGTAATGGG 1500 
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AAAATGTCTC GAGTAGAAGA 
TATGCCGTTT GCTCAGCATC 
ATCAACACTT TCCAGCTGAT 
GGCTGGTTTT ACAGTTTACT 
TTCTTATAAA CGTGCTTTAT 
AAAAAATGTC TAAAAGTAAA 
AATAAGTACG GAGCCGATTC 
TCCATGGAAT AACAAAAGAT 
CGAAATTTAT AGATACGCTT 
AATAAAATAG ATCACTATAA 
TACATTAGAT AATTGGGCTC 
GTAATATTTA TGTAAATAAT 
AACGAATATA CCAATACAAT 
ACGATTTTGG GAACAAGGAA 
CGCTTTATGA AATTTTAACA 
CCATTTATAT CTGAAAAAAT 
TTTACAAGAT TATCCACAAT 
AAGATGAAAT GCATACCGTT 
CGCAAAAATG CAGATTTAAA 
TAAACCTAAT AGTCAATTAA 
TAATAAAAGA CGAATTAAAT 
AATGACTATA TTACCTATGA 
AAAACTAGGG AACAAAACGA 
CAGAGTATGA TAAAAAAAGT 
TCTTCTGATG CTGAGTTAAC 
TAAGGATAGT TATCAACTCA 
ATAAAAATTT ATCTCCTGAA 
ATTAGATTAA TTCAACAATT 
ACGTATTGAT ATTTATATCG 
AAACCAATAA AAATATGTTT 
TTAAATGTTA TAGATGAATA 
AATAAAAATT TCCTTATTAT 



AGTAATCGAT GTTTGGTTTG 
ATTATCCTTT TGATAACCAG 
TTTATTGCAG AAGGAGTTGA 
AGTAATTTCT ACTATTCTAA 
CTTTAGGACA TATTCTAGAC 
GGAAACGTTA TTAATCCAAC 
TTTAAGATGG GCCTTAATTT 
TCTCAGAAAA TATAGTAGCT 
GATAATATTT ATAAATTTTA 
TCCTAATAAT GAAATTACAA 
TTTCTCGCTT AAACACCTTA 
TACGATTTCA CTTCCGCAGC 
AAGTAATTGG TATATCGGAG 
TTTCTAACGA TAAAAAAGAT 
ACTTTATCAA GACTAGTGGC 
CCATTATAAT TTGACTGGAA 
ATAAAGAAAG TTTTATTAAT 
ATAAAAATTG TAGAATTATC 
AATTAAGCAA CCTTTATCGA 
ACTTAAGTTT TTTACCTAAT 
ATAAAAAACA TTGAATTAAC 
GCTTAAATTG AATTTTTCTT 
AAAATATTCA AACATTGATA 
TTAATTGAGT CTAATAACTT 
TAAGGATGAT TTTATAATTA 
GTGAAGATAA TGACTGCGTT 
TTAATTCGCG AAGGACATGC 
AAGAAAAAAG AAAAATTTAC 
GTGTAACTGG GGAATTATTA 
AAAGAAAATT TCGTGATTAA 
TGAAAATACT ATTCATTTTA 
ATTAA 



ATAGCGGCTC 


1550 


AAAATTTTTA 


1600 


TCAAACGAGA 


1650 


AAGGAAAATC 


1700 


AGTAATGGTA 


1750 


TGAATTAATT 


1800 


CGGATAGTGC 


1850 


CAGACCAAAT 


1900 


TAATATGTAT 


1950 


AAAGTAGAAA 


2000 


ATAAAAGAAA 


2050 


CAGATTAATT 


2100 


ATTCGAGAGG 


2150 


GCGTACAATA 


2200 


TCCATTTGTT 


2250 


AAAGTGTGCA 


2300 


CAAGCATTGG 


2350 


TAGACAGGCT 


2400 


AAATGGTGAT 


2450 


TACTATTCAA 


2500 


TGATAATATT 


2550 


CTGTGGGACC 


2600 


GACTCCCTAT 


2650 


CAAAAGTTTA 


2700 


AAACCTTACC 


2750 


ATATTATTAG 


2800 


TAGAGAGCTC 


2850 


CAATAAATCA 


2900 


GAATCAATAA 


2950 


AAATATACAC 


3000 


ATAATAAAGA 


3050 




3075 



2) INFORMATION FOR SEQ ID NO: 2145 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2145 
GCAAGATGTG GCGTGTTACG GT 



2) INFORMATION FOR SEQ ID NO: 2146 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2146 
GGGGCGAAGA AGTTGTCCAT ATT 23 



2) INFORMATION FOR SEQ ID NO: 2147 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: V00622 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2147 

ATGGAGAAAA AAATCACTGG ATATACCACC GTTGATATAT CCCAATGGCA 50 

TCGTAAAGAA CATTTTGAGG CATTTCAGTC AGTTGCTCAA TGTACCTATA 100 

ACCAGACCGT TCAGCTGGAT ATTACGGCCT TTTTAAAGAC CGTAAAGAAA 150 

AATAAGCACA AGTTTTATCC GGCCTTTATT CACATTCTTG CCCGCCTGAT 200 

GAATGCTCAT CCGGAATTCC GTATGGCAAT GAAAGACGGT GAGCTGGTGA 2 50 

TATGGGATAG TGTTCACCCT TGTTACACCG TTTTCCATGA GCAAACTGAA 300 

ACGTTTTCAT CGCTCTGGAG TGAATACCAC GACGATTTCC GGCAGTTTCT 350 

ACACATATAT TCGCAAGATG TGGCGTGTTA CGGTGAAAAC CTGGCCTATT 4 00 

TCCCTAAAGG GTTTATTGAG AATATGTTTT TCGTCTCAGC CAATCCCTGG 4 50 

GTGAGTTTCA CCAGTTTTGA TTTAAACGTG GCCAATATGG ACAACTTCTT 500 

CGCCCCCGTT TTCACCATGG GCAAATATTA TACGCAAGGC GACAAGGTGC 550 

TGATGCCGCT GGCGATTCAG GTTCATCATG CCGTCTGTGA TGGCTTCCAT 6 00 

GTCGGCAGAA TGCTTAATGA ATTACAACAG TACTGCGATG AGTGGCAGGG 6 50 

CGGGGCGTAA 6 60 



2) INFORMATION FOR SEQ ID NO: 2148 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2148 
CAGATTAAAT GCGGATTCAG CC 22 
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2) INFORMATION FOR SEQ ID NO: 2149 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION:' SEQ ID NO: 2149 
ATCAGGTAAA TCATCAGCGG ATA 



2) INFORMATION FOR SEQ ID NO: 2150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: K12 

(C) ACCESSION NUMBER: X53796 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2150 

ATGAATTTTA CCCGGATTGA CCTGAATACC TGGAATCGCA GGGAACACTT 50 

TGCCCTTTAT CGTCAGCAGA TTAAATGCGG ATTCAGCCTG ACCACCAAAC 100 

TCGATATTAC CGCTTTGCGT ACCGCACTGG CGGAGACAGG TTATAAGTTT 150 

TATCCGCTGA TGATTTACCT GATCTCC CGG GCTGTTAATC AGTTTCCGGA 200 

GTTCCGGATG GCACTGAAAG ACAATGAACT TATTTACTGG GACCAGTCAG 250 

ACCCGGTCTT TACTGTCTTT CATAAAGAAA CCGAAACATT CTCTGCACTG 300 

TCCTGCCGTT ATTTTCCGGA TCTCAGTGAG TTTATGGCAG GTTATAATGC 350 

GGTAACGGCA GAATATCAGC ATGATACCAG ATTGTTTCCG CAGGGAAATT 4 00 

TACCGGAGAA TCACCTGAAT ATATCATCAT TACCGTGGGT GAGTTTTGAC 450 

GGATTTAACC TGAACATCAC CGGAAATGAT GATTATTTTG CCCCGGTTTT 500 

TACGATGGCA AAGTTTCAGC AGGAAGGTGA CCGCGTATTA TTACCTGTTT 550 

CTGTACAGGT TCATCATGCA GTATGTGATG GCTTTCATGC AGCACGGTTT 600 

ATTAATACAC TTCAGCTGAT GTGTGATAAC ATACTGAAAT AA 642 



2) INFORMATION FOR SEQ ID NO: 2151 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2151 
ATATTTCAGC ATTACCTTGG GTT 23 



SEQUENCE DESCRIPTION: SEQ ID NO: 2152 



2) INFORMATION FOR SEQ ID NO: 2152 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(xi) 

TACACAACTC TTGTAGCCGA TTA 



2) INFORMATION FOR SEQ ID NO: 2153 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



23 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexineri 
(C) ACCESSION NUMBER: X07848 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2153 

ATGAACTATA CAAAATTTGA TGTAAAAAAT TGGGTTCGCC GTGAGCATTT 50 

TGAGTTTTAT CGGCATCGTT TACCATGTGG TTTTAGCTTA ACAAGCAAAA 100 

TTGATATCAC GACGTTAAAA AAGTCATTGG ATGATTCAGC GTATAAGTTT 150 

TATCCGGTAA TGATCTATCT GATTGCTCAG GCCGTGAATC AATTTGATGA 200 

GTTGAGAATG GCGATAAAAG ATGATGAATT GATCGTATGG GATTCAGTCG 250 

ACCCACAATT CACCGTATTC CATCAAGAAA CAGAGACATT TTCAGCACTG 300 

AGTTGCCCAT ACTCATCCGA TATTGATCAA TTTATGGTGA ATTATTTATC 350 

GGTAATGGAA CGTTATAAAA GTGATACCAA GTTATTTCCT CAAGGGGTAA 400 

CACCAGAAAA TCATTTAAAT ATTTCAGCAT TACCTTGGGT TAATTTTGAT 450 

AGCTTTAATT TAAATGTTGC TAATTTTACC GATTATTTTG CACCCATTAT 500 

AACAATGGCA AAATATCAGC AAGAAGGGGA TAGACTGTTA TTGCCGCTCT 550 

CAGTACAGGT TCATCATGCA GTTTGTGATG GCTTCCATGT TGCACGCTTT 600 

ATTAATCGGC TACAAGAGTT GTGTAACAGT AAATTAAAAT AA 642 
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2) INFORMATION FOR SEQ ID NO: 2154 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2154 
CGCCATTCAG AGTTTAGGAC 20 



2) INFORMATION FOR SEQ ID NO: 2155 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2155 
TTCCATACCG TTGCGTATCA CTT 23 



2) INFORMATION FOR SEQ ID NO: 2156 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: CP590 

(C) ACCESSION NUMBER: M74769 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2156 

ATGGTATTTG AAAAAATTGA TAAAAATAGT TGGAACAGAA AAGAGTATTT 50 

TGACCACTAC TTTGCAAGTG TACCTTGTAC ATACAGCATG ACCGTTAAAG 100 

TGGATATCAC ACAAATAAAG GAAAAGGGAA TGAAACTATA TCCTGCAATG 150 

CTTTATTATA TTGCAATGAT TGTAAACCGC CATTCAGAGT TTAGGACGGC 200 

AATCAATCAA GATGGTGAAT TGGGGATATA TGATGAGATG ATACCAAGCT 250 
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ATACAATATT TCACAATGAT ACTGAAACAT 
TGTAAGTCTG ACTTTAAATC ATTTTTAGCA 
ACGGTATGGA AACAATCATA GAATGGAAGG 
ACATTTTTAA TGTATCTATG ATACCGTGGT 
CTGAATTTGC AGAAAGGATA TGATTATTTG 
GAAATATTAT AAAGAAGATA ACAAAATTAT 
TTCATCACGC AGTATGTGAC GGATTTCACA 
TTGCAGGAAT TGATAAATAG TTAA 



PCT/CA00/01150 

TTTCCAGCCT TTGGACTGAG 300 

GATTATGAAA GTGATACGCA 350 

AAAGCCAAAT GCTCCGGAAA 400 

CAACCTTCGA TGGCTTTAAT 450 

ATTCCTATTT TTACTATGGG 500 

ACTTCCTTTG GCAATTCAAG 550 

TTTGCCGTTT TGTAAACGAA 600 

624 



2) INFORMATION FOR SEQ ID NO: 2157 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2157 
CCACAGAAAT TGATATTAGT GTTTTAT 27 



2) INFORMATION FOR SEQ ID NO: 2158 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2158 
TCGCTATTGT AACCAGTTCT A '21 



2) INFORMATION FOR SEQ ID NO: 2159 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: V01277 
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(xi) SEQUENCE DESCRIPTION: SEQ 

ATGAACTTTA ATAAAATTGA TTTAGACAAT 
TAATCATTAT TTGAACCAAC AAACGACTTT 
ATATTAGTGT TTTATACCGA AACATAAAAC 
CCTGCATTTA TTTTCTTAGT GACAAGGGTG 
TAGAACTGGT TACAATAGCG ACGGAGAGTT 
AGCCACTTTA TACAATTTTT GATGGTGTAT 
TGGACTCCTG TAAAGAATGA CTTCAAAGAG 
TGATGTAGAG AAATATAATG GTTCGGGGAA 
TACCTGAAAA TGCTTTTTCT CTTTCTATTA 
GGGTTTAACT TAAATATCAA TAATAATAGT 
TACAGCAGGA AAATTCATTA ATAAAGGTAA 
CTTTACAGGT ACATCATTCT GTTTGTGATG 
ATGAACTCTA TTCAGGAATT GTCAGATAGG 
A 



PCT/CAOO/01150 

ID NO: 2159 

TGGAAGAGAA AAGAGATATT 50 

TAGTATAACC ACAGAAATTG 100 

AAGAAGGATA TAAATTTTAC 150 

ATAAACTCAA ATACAGCTTT 200 

AGGTTATTGG GATAAGTTAG 250 

CTAAAACATT CTCTGGTATT 3 00 

TTTTATGATT TATACCTTTC 350 

ATTGTTTCCC AAAACACCTA 400 

TTCCATGGAC TTCATTTACT 4 50 

AATTACCTTC TACCCATTAT 500 

TTCAATATAT TTACCGCTAT 550 

GTTATCATGC AGGATTGTTT 600 

CCTAATGACT GGCTTTTATA 650 

651 



2) INFORMATION FOR SEQ ID NO: 2160 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2160 
TTTTGAACAC TATTTTAACC AGC 23 



2) INFORMATION FOR SEQ ID NO: 2161 

(i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2161 
GATTTAACTT ATCCCAATAA CCT 23 



2) INFORMATION FOR SEQ ID NO: 2162 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: X02166 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2162 



ATGACTTTTA ATATTATCAA 
TGAACACTAT TTTAACCAGC 
ATATTACTTT GTTTAAAGAT 
CCTTCTTTGA TTTATGCAAT 
TAGAACAGGA ATTAATAGTG 
ATCCTTTGTA TACAGTTTTT 
TGGACTGAAT CTGATAACAA 
TGACTTGCTT GAATATAAAG 
TACCTGAAAA CACCATACCG 
TCATTTAATT TAAACATTGG 
TACGATAGGT AAATTTTATA 
CTTTGCAGCT TCATCATGCT 
ATGAATGAAT TTCAAGATAT 



ATTAGAAAAT TGGGATAGAA 
AAACTACGTA TAGCATTACT 
ATGATAAAAA AGAAAGGATA 
TATGGAAGTT, GTAAATAAAA 
AGAATAAATT AGGTTATTGG 
AATAAGCAAA CTGAAAAATT 
CTTCACTTCT TTTTATAATA 
ATAAAGAAGA AATGTTTCCT 
ATTTCAATGA TTCCTTGGAT 
TAACAATAGC AACTTTTTAT 
GTGAGAATAA TAAAATTTAT 
GTATGTGATG GTTACCATGC 
AATTCATAAG GTAGATGATT 



AAGAATATTT 50 

AAAGAAATTG 100 

TGAAATTTAT 150 

ATAAAGTGTT 200 

GATAAGTTAA 2 50 

TACTAACATT 300 

ATTATAAAAA 3 50 

AAAAAACCGA 4 00 

TGATTTTAGT 4 50 

TGCCTATTAT 500 

ATACCAGTTG 550 

TTCATTATTT 600 

GGATTTAG 648 



2) INFORMATION FOR SEQ ID NO: 2163 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2163 
ACCTTCATCC TACCGATGTG GGTT 24 



2) INFORMATION FOR SEQ ID NO: 2164 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2164 
CAACGACACC AGCACTGCCA TTG 23 



2) INFORMATION FOR SEQ ID NO: 2165 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1215 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella typhimurium 

(B) STRAIN: H3380 

(C) ACCESSION NUMBER; AF071555 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2165 

ATGACCACCA CACGCCCCGC GTGGGCCTAT ACGCTGCCGG CAGCACTGCT 50 

GCTGATGGCT CCTTTCGACA TCCTCGCTTC ACTGGCGATG GATATTTATC 100 

TCCCTGTCGT TCCAGCGATG CCCGGCATCC TGAACACGAC GCCCGCTATG 150 

ATCCAACTCA CGTTGAGCCT CTATATGGTG ATGCTCGGCG TGGGCCAAGT 200 

GATTTTTGGT CCGCTCTCAG ACAGAATCGG GCGACGGCCA ATTCTACTTG 250 

CGGGCGCAAC GGCTTTCGTC ATTGCGTCTC TGGGAGCAGC TTGGTCTTCA 3 00 

ACTGCACCGG CCTTTGTCGC TTTCCGTCTA CTTCAAGCAG TGGGCGCGTC 350 

GGCCATGCTG GTGGCGACGT TCGCGACGGT TCGCGACGTT TATGCCAACC 4 00 

GTCCTGAGGG TGTCGTCATC TACGGCCTTT TCAGTTCGAT GCTGGCGTTC 4 50 

GTGCCTGCGC TCGGCCCTAT CGCCGGAGCA TTGATCGGCG AGTTCTTGGG 5 00 

ATGGCAGGCG ATATTCATTA CTTTGGCTAT ACTGGCGATG CTCGCACTCC 550 

TAAATGCGGG TTTCAGGTGG CACGAAACCC GCCCTCTGGA TCAAGTCAAG 6 00 

ACGCGCCGAT CTGTCTTGCC GATCTTCGCG AGTCCGGCTT TTTGGGTTTA 650 

CACTGTCGGC TTTAGCGCCG GTATGGGCAC CTTCTTCGTC TTCTTCTCGA 7 00 

CGGCTCCCCG TGTGCTCATA GGCCAAGCGG AATATTCCGA GATCGGATTC 750 

AGCTTTGCCT TCGCCACTGT CGCGCTTGTA ATGATCGTGA CAACCCGTTT 800 

CGCGAAGTCC TTTGTCGTCA GATGGGGCAT CGCAGGATGC GTGGCGCGTG 850 

GGATGGCGTT GCTTGTTTGC GGAGCGGTCC TGTTGGGGAT CGGCGAACTT 900 

TACGGCTCGC CGTCATTCCT CACCTTCATC CTACCGATGT GGGTTGTCGC 950 

GGTCGGTATT GTCTTCACGG TGTCCGTTAC CGCGAACGGC GCTTTGGCAG 1000 

AGTTCGACGA CATCGCGGGA TCAGCGGTCG CGTTCTACTT CTGCATCCAA 1050 

AGCCTGATAG TCAGTATCGT CGGGACATTG GCGGTGACGC TGTTAAACGG 1100 

CGATACAGCG TGGCCCGTGA TTTGTTACGC CACGGCAATG GCAGTGCTGG 1150 

TGTCGTTGGG GCTGGCGCTC CTTCGATCCC GTGATGCTGC CACCGAGAAG 1200 

TCGCCAGTCG TCTAG 1215 



2) INFORMATION FOR SEQ ID NO: 2166 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2166 
GACAAACCAT TCCTGCTG 
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2) INFORMATION FOR SEQ ID NO: 2167 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 2167 
CAGCAGCTGG GCGGCGGT 18 



2) INFORMATION FOR SEQ ID NO: 2168 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2168 
CATCAAAGTT GGTGAAGAAG TTG 23 



2) INFORMATION FOR SEQ ID NO: 2169 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2169 
CCCGTTTGCG AAAGGTGG 18 



2) INFORMATION FOR SEQ ID NO: 2170 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2170 
ACGTGACGTT GACAAACCA 19 



2) INFORMATION FOR SEQ ID NO: 2171 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases' 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY:. Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2171 
TCGTTGGATT AACTGAAGAA 20 
2) INFORMATION FOR SEQ ID NO: 2172 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2172 
GTGTTGAAAT GTTCCGTAAA 20 



2) INFORMATION FOR SEQ ID NO: 2173 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2173 
AAGAAAAAAT CTTCGAACTG GCTA 24 



2) INFORMATION FOR SEQ ID NO: 2174 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2174 
TCTACACGGC CGGTG 15 



2) INFORMATION FOR SEQ ID NO: 2175 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2175 
CCGCCATACC CCGTTT 16 



2) INFORMATION FOR SEQ ID NO: 2176 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2176 
CGGCATTACC ATTTCCACAC CTTT 24 



2) INFORMATION FOR SEQ ID NO: 2177 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2177 

GGCACGGACA AACCATTCCT GCTGCCTATC GAAGACGTGT TCCCGTGCC 



2) INFORMATION FOR SEQ ID NO: 2178 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear . 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 2178 
GGCACGACAA ACCATTCCTG CTGCCTATCG AACGTGCC 



2 ) INFORMATION FOR SEQ ID NO: 2179 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2179 
GGCAGCTCTA CTTCCGTACC ACTGACGTAA CCGGCTGCC 



2) INFORMATION FOR SEQ ID NO: 2180 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2180 
TTCGCCGGCG TGGGC 



2) INFORMATION FOR SEQ ID NO: 2181 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2181 
AGCGCCACGC GCAGG 15 



2) INFORMATION FOR SEQ ID NO: 2182 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2182 
GCGCGCCAAC GACTTCTACC ACGAAATGGA AGAGTCGCGC GC 42 
2) INFORMATION FOR SEQ ID NO: 2183 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alcallgenes faecalis subsp. faecalis 

(B) STRAIN: ATCC 8750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2183 

TATCTTGGTT WGCTCGGCCG CTGACGGCCC AATGCCTCAG ACTCGCGAGC 
ACATCCTGCT GAGCCGTCAG GTTGGCGTTC CTTACATCAT CGTGTTCCTG 
AACAAGGCCG ACATGGTTGA TGACGAAGAG CTGATCGAAC TGGTTGAAAT 
GGAAGTTCGC GAGCTGTTGT CCAAGTACGA CTTCCCTGGC GACGACACCC 
CGATCATCAA GGGTTCGGCC AAACTGGCTC TGGAAGGCGA CGAAGGCCCA 
CTGGGCAGCC AAGCCGTTCT GGCTCTGGCC GAAGCGCTGG ACAACTACAT 
TCCTACGCCT GAGCGTGCCG TTGACGGTAC GTTCCTGATG CCTGTTGAAG 
ACGTGTTCTC GATCTCCGGC CGTGGTACGG TTGTGACCGG TCGTATTGAG 
CGCGGCATCA TCAAGGTCGG CGAAGAAATC GAAATCGTGG GTATCAAAGA 
CACGGTCAAG ACCATTTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 
ACCAGGGCGA AGCTGGCGAT AACGTCGGTC TGCTGCTGCG TGGTACCAAG 
CGTGAAGACG TGGAACGTGG TCAAGTTCTG GCCAAGCCAG GCTCGATCAA 
GCCACACACT GACTTCGACG CCGAGGTGTA CATTCTGTCC AAAGAAGAAG 
GTGGTCGTCA CACTCCTTTC TTCAAGGGCT ACCGTCCTCA GTTCTACTTC 

1120 
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CGTACAACTG ACGTGACCGG CACCATCGAG CTGCCAGAAG ACAAGGAAAT 
GGTTCTGCCA GGCGACAACA TTTCGATGAA AGTGTCCCTG ATCGCTCCTA 
TCGCCATGGA AGAAGGT 



750 
800 
817 



2) INFORMATION FOR SEQ ID NO: 2184 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1652 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter coli 

(B) STRAIN: ATCC 43479 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2184 

TACAATTGAA GTTGAAAGAT CTATGCGTGT TCTTGATGGT GCTGTTGCGG 50 

TATTTTGTTC AGTGGGTGGG GTGCAGCCTC AAAGTGAAAC AGTTTGGAGA 100 

CAAGCAAATA AATATGGTGT TCCAAGAATA GTATTTGTAA ATAAAATGGA 150 

TAGAATCGGT GCAAATTTCT ACAATGTAGA AGATCAAATT CGCAACCGTT 200 

TAAAAGCTAA TCCAGTTCCA CTTCAAATTC CAATCGGTGC TGAGGATAAT 250 

TTTAAAGGCG TAATCGATCT TGTAACTATG AAAGCTTTAG TTTGGGAAGA 300 

TGATACTAAG CCAACGGATT ATGTAGAAAA AGAAATTCCA GCTGAACTTA 3 50 

AAGAAAAGGC AGAAGAATAT CGCACAAAAA TGATAGAAGC AGTTTCTGAA 400 

ACTTCAGATG AGTTGATGGA AAAATATTTA GGTGGAGAAG AATTAAGCCT 4 50 

TGAAGAGATT AAAACAGGGA TTAAAGCAGG ATGTTTAAGT CTTTCTATCG 500 

TTCCTATGCT TTGCGGTACA GCGTTTAAAA ATAAAGGGGT TCAACCTTTG 550 

CTTGATGCTG TTGTGGCTTA TTTACCAGCT CCTGATGAAG TTGCTAATAT 60 0 

CAAAGGGGAA TATGAAGACG GCACAGAAGT TTCTGTAAAA TCAACTGATG 650 

ATGGCGAATT TGCAGGACTT GCATTTAAAA TTATGACAGA TCCATTTGTA 700 

GGACAACTTA CTTTCGTGCG TGTTTATCGT GGATGTTTAG AAAGCGGTTC 750 

TTATGCTTAT AACTCAACCA AAGATAAAAA AGAAAGAATT GGTCGTTTGT 800 

TAAAAATGCA CTCTAATAAA AGAGAAGAAA TTAAAGTTCT TTACGCAGGA 850 

GAAATCGGTG CAGTTGTAGG ACTTAAAGAT ACTTTAACAG GGGATACTCT 900 

TGCAAGTGAA AAAGATAAAG TAATTCTTGA AAGAATGGAT TTCCCAGACC 950 

CAGTTATTTC TGTTGCAGTA GAMCCAAAAA CTAAAGCAGA TCAAGAAAAA 1000 

ATGTCTATTG CACTAAATAA ATTAGCTCAA GAAGATCCAA GCTTTAGAGT . 1050 

TTCTACGGAT GAAGAAAGTG GTCAAACTAT CATTTCAGGT ATGGGTGAAT 1100 

TACACCTTGA AATTATTGTT GATCGTATGC TTCGTGAATT TAAAGTGGAA 1150 

GCTGAAGTGG GACAACCTCA AGTTGCTTAT CGTGAAACTA TCAGAAAAAC 1200 

TGTTGAGCAA GAATACAAAT ACGCTAAGCA ATCAGGCGGT CGTGGTCAGT 1250 

ATGGACATGT ATTCTTACGC CTTGAGCCAC TTGAGCCAGG AAGTGGATAC 1300 

GAGTTTGTTA ACGACATCAA AGGTGGGGTA ATTCCAAAAG AATATATTCC 1350 

TGCAGTAGAT AAGGGTGTTC AAGAAGCATT GCAAAATGGT GTTTTAGCAG 14 00 

GTTATCCAGT AGAAGATGTT AAAGTAACTG TTTATGATGG AAGTTATCAC 1450 

GAGGTGGATT CATCTGAGAT GGCATTTAAA CTTGCTGCTT CTATGGGATT 1500 

TAAAGAGGGT GCTAGAAAAG CAGGTGCTGT GATCTTAGAG CCTATGATGA 1550 

AAGTTGAAGT AGAAACTCCT GAAGATTACA TGGGCGATGT TATTGGCGAT 1600 

CTTAATAAGC GTCGTGGTCA AGTAAATAGC ATGGATGAAA GAGGTGGTAA 1650 

TA 1652 
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2) INFORMATION FOR SEQ ID NO: 2185 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
"(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succinivibrio dextrinosolvens 

(B) STRAIN: ATCC 19716 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2185 

GCTATTCTAG TAGTAGCAGC AACTGATGGT CCTATGCCAC AGACCCGTGA 50 

GCACATCCTA TTAGCACGTC AGGTAGGCGT ACCATACATC ATCGTATTCC 100 

TAAACAAGTG CGATATGGTT GACGACGAGG AATTATTAGA GTTAGTTGAG 150 

ATGGACGTAC GTGATCTATT AAATCAGTAC CAGTTCCCAG GCGACGACAC 200 

TCCAATCATC CGTGGTTCAG CACTAGGTGC ATTAAACGGC GAAGAGAAGT 250 

GGAAAGAGGC AATCTATCAG TTAGCAGACA CTCTAGATTC ATACATTCCA 3 00 

GAGCCAAAGC GTGATATCGA TGATCCATTC CTATTACCAA TCGAAGATAT 3 50 

CTTCTCAATC TCAGGTCGTG GTACTGTAGT AACCGGCCGT GTAGAGCGTG 4 00 

GTATTGTACA CGTAGGTGAC GAAGTTGAAA TCGTTGGTAT TCGTCCAACC 450 

ACCAAGACCA CTGTAACTGG CGTTGAAATG TTCCGTAAGT TACTAGACGA 500 

AGGTCGTGCA GGTGATAACG TTGGTGTTCT ACTACGTGGT ACCAAGCGTG 550 

ATGAGGTTGA GCGTGGTCAG GTTCTAGCTG CTCCAGGCAC AATCACTCCA 600 

CACACCAAGT TCACTGGTCA GGTTTACGTA CTAAGCAAGG ATGAAGGTGG 650 

TCGTCACACT CCATTCTTCA AGGGCTACCG TCCACAGTTC TTCTTCCGTA 700 

CAACCGATAT TACCGGTTCT ATCGATCTGA AAGAGGGCGT AGAGATGGTA 750 

ATGCCAGGTG ATAACACCGA CATGACCGTA ACCCTAATCC ACCCAGTAGC 80 0 

TATGGCTGAA GGCGAGAGAT 820 



2) INFORMATION FOR SEQ ID NO: 2186 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2186 
ACGCGCTCAA AGCAGAAGTA TACGTATTAT CAAAAGACGC GCGT 



2) INFORMATION FOR SEQ ID NO: 2187 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1612 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN: ATCC 33292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2187 

CAGTGGGTGG GGTGCAGCCT CAAAGTGAAa' CAGTTTGGAG ACAAGCAAAT ' 50 

AAATATGGTG TTCCAAGAAT AGTATTTGTA AACAAAATGG ATAGAATCGG 100 

TGCAAATTTC TACAATGTAG AAGATCAAAT TCGCAACCGT TTAAAAGCTA 150 

ATCCAGTTCC ACTTCAAATT CCAATCGGTG CTGAGGATAA TTTTAAAGGC 200 

GTAATCGATC TTGTAACTAT GAAAGCTTTA GTTTGGGAAG ATGATACTAA 250 

GCCAACGGAT TATGTAGAAA AAGAAATTCC AGCTGAACTT AAAGAAAAGG 300 

CAGAAGAATA TCGCACAAAA ATGATAGAAG CAGTTTCTGA AACTTCAGAT 350 

GAGTTGATGG AAAAATATT T AGGCGGAGAA GAATTAAGCC TTGAAGAGAT 4 00 

TAAAACAGGG ATTAAAGCAG GATGTTTAAG TCTTTCTATC GTTCCTATGC 450 

TTTGCGGTAC AGCGTTTAAA AATAAAGGGG TTCAACCTTT ACTTGATGCT 500 

GTTGTGGCTT ATTTACCAGC TCCTGATGAA GTGGCAAATA TCAAGGGTGA 550 

ATATGAAGAT GGCACAGAAG TTTCTGTAAA ATCAACTGAT GATGGCGAGT 600 

TTGCAGGACT TGCATTTAAA ATTATGACAG ATCCATTTGT AGGACAACTT 650 

ACTTTCGTGC GTGTTTATCG CGGTTGTTTA GAAAGTGGTT CTTATGCTTA 700 

TAACTCAACT AAAGATAAAA AAGAAAGAAT TGGTCGTTTG TTAAAAATGC 750 

ACTCTAACCA AAGAGAAGAG ATTAAAGTGC TTTACGCAGG CGAAATTGGT 800 

GCTGTTGTAG GACTTAAAGA TACTTTAACA GGGGATACTC TTGTAAGTGA 850 

AAAAGATAAG GTAATCCTTG AAAGAATGGA TTTTCCAGAT CCAGTTATTT 900 

CTGTTGCAGT TGAGCCAAAA ACTAAAGCAG ATCAAGAAAA AATGTCTATT 950 

GCTTTAAATA AATTAGCACA AGAAGATCCA AGTTTTAGAG TTTCTACAGA 1000 

TGAAGAAAGT GGCCAAACTA TCATTTCAGG TATGGGTGAG TTACACCTTG 1050 

AAATTATCGT TGATAGAATG CTTCGTGAAT TTAAAGTTGA AGCTGAAGTA 1100 

GGTCAACCAC AAGTTGCTTA TCGCGAAACT ATTAGAAAAA CTGTTGAACA 1150 

AGAATACAAA TACGCTAAAC AATCAGGTGG TCGTGGTCAG TATGGACATG 1200 

TATTCTTACG CCTTGAACCA CTTGAGCCAG GTAGTGGATA TGAATTTGTT 1250 

AATGATATCA AAGGTGGAGT AATTCCAAAA GAATACATTC CTGCAGTTGA 1300 

TAAAGGTGTT CAAGAAGCAT TACAAAATGG TGTTTTAGCA GGTTATCCTG 1350 

TGGAAGATGT TAAAGTAACT GTTTATGATG GAAGTTATCA CGAGGTGGAT 14 00 

TCATCTGAGA TGGCGTTTAA ACTTGCTGCT TCTATGGGCT TTAAAGAAGG 14 50 

TGCTAGAAAA GCAGGTGCTG TGATCTTAGA GCCTATGATG AAAGTTGAAG 1500 

TAGAAACTCC TGAAGATTAC ATGGGTGATG TTATTGGAGA TCTTAACAAA 1550 

CGCCGTGGTC AAGTAAATAG CATGGATGAG CGTGGTGGAA ATAAAATCAT 1600 

CACAGCATTT TG 1612 



2) INFORMATION FOR SEQ ID NO: 2188 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1667 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN: ATCC 33560 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2188 

ACAATTGAAG TTGAAAGATC TATGCGTGTT CTTGATGGTG CTGTTGCGGT 50 
ATTTTGTTCA GTGGGTGGGG TGCAGCCTCA AAGTGAAACA GTTTGGAGAC 100 
AAGCAAATAA ATATGGTGTT CCAAGAATAG TATTTGTAAA TAAAATGGAT 150 
AGAATCGGTG CAAATTTCTA CAATGTAGAA GATCAAATTC GCAACCGTTT 200 
AAAAGCTAAT CCAGTTCCAC TTCAAATTCC AATTGGTGCT GAGGATAATT 2 50 

TTAAAGGCGT AATCGATCTT GTAACTATGA AAGCTTTAGT TTGGGAAGAT 3 00 

GATACTAAGC CAACGGATTA TGTAGAAAAA GAAATTCCAG CTGAACTTAA 350 

AGAAAAGGCA GAAGAATATC GCACAAAAAT GATAGAAGCA GTTTCTGAAA 4 00 

CTTCAGATGA GTTGATGGAA AAATATTTAG GCGGAGAAGA ATTAAGCCTT 450 

GAAGAGATTA AAACAGGGAT TAAAGCAGGA TGTTTAAGTC TTTCTATCGT 500 
TCCTATGCTT TGCGGTACAG CGTTTAAAAA TAAAGGGGTT CAACCTTTGC 550 
TTGATGCTGT TGTGGCTTAT TTACCAGCTC CTGATGAAGT GGCAAATATC 600 

AAGGGTGAAT ATGAAGATGG CACAGAAGTT TCTGTAAAAT CAACTGATGA 650 

TGGCGAGTTT GCAGGACTTG CATTTAAAAT CATGACAGAT CCATTTGTAG 700 

GACAACTTAC TTTCGTGCGT GTTTATCGCG GTTGTTTAGA AAGCGGTTCT 750 

TATGCGTATA ACTCAACTAA AGATAAAAAA GAAAGAATTG GTCGTTTGTT 800 

AAAAATGCAC TCTAACAAAA GAGAAGAGAT TAAAGTGCTT TACGCAGGCG 850 

AAATTGGTGC TGTTGTAGGA CTTAAAGATA CTTTAACAGG GGATACTCTT 900 

GCAAGTGAAA AAGATAAGGT AATCCTTGAA AGAATGGATT TTCCAGATCC 950 

AGTTATTTCT GTTGCAGTTG AGCCAAAAAC TAAAGCTGAT CAAGAAAAAA 10 00 

TGTCTATTGC TTTAAATAAA TTAGCACAAG AAGATCCAAG TTTTAGAGTT 1050 

TCTACAGATG AAGAAAGTGG TCAAACTATC ATTTCAGGTA TGGGTGAGTT 1100 

ACACCTTGAA ATTATCGTTG ATAGAATGCT TCGTGAATTT AAAGTTGAAG 1150 

CTGAAGTAGG TCAACCACAA GTTGCTTATC GCGAAACTAT TAGAAAAACT 1200 

GTTGAACAAG AATACAAATA CGCTAAACAA TCAGGTGGTC GTGGTCAGTA 1250 

TGGACATGTA TTCTTACGCC TTGAACCACT TGAGCCAGGT AGTGGATATG 13 00 

AATTTGTTAA TGATATCAAA GGTGGGGTAA TTCCAAAAGA ATACATTCCT 1350 

GCAGTTGATA AAGGTGTTCA AGAAGCATTA CAAAATGGTG TTTTAGCAGG 14 00 

TTATCCTGTG GAAGATGTTA AAGTAACTGT TTATGATGGA AGTTATCACG 1450 

AGGTGGATTC ATCTGAGATG GCGTTTAAAC TTGCTGCTTC TATGGGCTTT 1500 

AAAGAAGGTG CTAGAAAAGC AGGCGCTGTG ATCTTAGAGC CTATGATGAA 15 50 

AGTTGAAGTA GAAACTCCTG AAGATTATAT GGGTGATGTT ATTGGAGATC 1600 

TTAACAAACG CCGTGGTCAA GTAAATAGCA TGGATGAGCG TGGTGGAAAT 1650 

AAAATCATCA CAGCATT 1667 



2) INFORMATION FOR SEQ ID NO: 2189 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1255 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Leishmania 

(B) STRAIN: ATCC 50126 

(xi) SEQUENCE DESCRIPTION: SEQ 

TGACGGCGCT GGACGTGACG GAGGACCTCG 
CTGGAGATCG TGCAGCACTT GGATGCGAAC 
GCAGACGACG GACCTGCTGA AGCTGAAGTC 
GCAACATCTC CGTGCCGGTG GGCCGCGAGA 
GTGCTGGGCG ACGCGATTGA CCACCGCGGC 
CATGGCGATC CACGCCGAGG CGCCGAAGCT 
ACACGATCCT GACGACCGGC ATCAAGGTGA 
TGCAAGGGCG GCAAGATCGG CCTGTTCGGC 
TGTGATCATC ATGGAGCTGA TCAACAACGT 
TCTCCGTGTT CGCCGGCGTT GGCGAGCGCA 
TACCTGGAGA TGATGCAGTC AAAGGTGATT 
GTGCGTGCTT GTGTACGGGC AGATGAACGA 
GCGTTGCGCA GTCTGCGCTG ACGATGGCCG 
GGCCAGAACG TGCTGCTGTT CATCGACAAC 
GAACTCCGAG GTGTCTGCGC TGCTGGGCCG 
ACCAGCCGAC GCTTGCGGAG GATCTTGGCA 
TCGACGACGA AGGGATCGAT TACGTCTGTG 
GGATGATATC ACGGACCCCG CGCCCGCGAC 
CGACGACTGT GCTGGACCGC GCGGTGGCGG 
GTGAACCCGC TGGAGTGCGC GTCGCGCATC 
CGTGGACCAT TACAACGTTG CACAGGATAT 
ACAAGGAGCT GCAGGACATC ATTGCGGTGC 
GAGGAGGACA AGGTCGTGGT GGACCGCGCG 
GTCGCAGCCG TTCCAGGTTG CGGAGGTGTT 
ACGTGCAGCT GAGCGACACG GTGGAGTCGT 
TCGTA 



guyanensis 
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GCCGCGATGA 


GCCGCTGACG 


50 


ACCGGCCGCT 


GCATTGCGAT 


100 


GAAGGTTGTG 


TCGACCGGCG 


150 


CACTGGGCCG 


CATCTTCAAC 


200 


CCCGTGTGCG 


AGAAGATGCG 


250 


GGCGGACCAG 


GCTGCGGAGG 


300 


TCGACCTGAT 


TCTGCCCTAC 


350 


GGTGCCGGTG 


TGGGCAAGAC 


400 


CGCGAAGGGG 


CACGGCGGCT 


450 


CGCGCGAGGG 


CACGGACCTG 


500 


GACCTGAAGG 


GCGAGTCGAA 


550 


GCCCCCGGGT 


GCGCGCGCGC 


600 


AGTACTTCCG 


CGACGTGGAG 


650 


ATCTTCCGCT 


TCACGCAGGC 


700 


CATCCCGGCC 


GCCGTGGGTT 


750 


TGCTGCAGGA 


GCGCATTACG 


800 






850 

V J V 


GACGTTCTCG 


CACCTGGACG 


900 


AGTCGGGCAT 


CTACCCTGCC 


950 


ATGGACCCCG 


ATGTGATCGA 


1000 


CGTCCAGATG 


CTGACCAAGT 


1050 


TTGGCATCGA 


CGAGCTGAGC 


1100 


CGCAAGGTGA 


CGCGGTTCCT 


1150 


CACCGGCATG 


ACGGGCCACT 


1200 


TCTCTGGCCT 


GCTGATGGGG 


1250 






1255 



2) INFORMATION FOR SEQ ID NO: 2190 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1248 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei subsp. brucei 

(B) STRAIN: EATRO 795 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2190 

GCCCTTGACG TTGTTGACAA ACTTGGCCGT GATGAGCCTC TTACTCTTGA 50 

GATCGTGCAG CATCTTGACG CCCACACGGG CCGCTGTATC GCGATGCAAA 100 

CGACGGATCT CCTCAAACTG AAGGCAAAGG TCGTTTCGAC AGGTGGCAAC 150 

ATTTCCGTTC CTGTCGGCCG GGAAACACTA GGTCGTATCT TCAACGTGCT 2 00 

TGGAGACGCT ATTGACCAGC GCGGCCCCGT TGGTGAGAAA CTGCGCATGC 250 

CCATCCATGC CGTGGCTCCC AAGCTTGCGG ACCAGGCCGC TGAGGATGCG 3 00 

GTGCTCACAA CTGGTATTAA GGTGATTGAT CTCATTCTCC CTTACTGCAA 350 
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AGGTGGAAAA ATTGGCCTCT TTGGGGGTGC GGGTGTGGGC AAAACCGTCA 4 00 

TTATTATGGA GCTCATTAAC AACGTTGCCA AGGGTCACGG TGGTTTCTCT 4 50 

GTCTTCGCTG GTGTTGGTGA GCGTACCCGT GAGGGAACGG ATTTGTATCT 500 

TGAGATGATG CAGTCTAAGG TTATTGACCT TAAGGGTGAG TCCAAATGTG 550 

TGTTGGTGTA CGGTCAGATG AACGAGCCCC CAGGTGCCCG TGCGCGTGTT 600 

GCGCAGTCGG CTCTGACGAT GGCTGAGTAC TTCCGTGATG TGGAGGGCCA 650 

AGATGTGCTT CTTTTTATCG ACAATATTTT TCGTTTCACT CAGGCTAACT 700 

CCGAGGTGTC GGCGCTTCTG GGTCGTATTC CCGCCGCTGT TGGCTACCAG 750 

CCTACCCTCG CTGAGGATCT AGGGCAGTTG CAGGAGCGTA TTACCTCAAC 800 

AACGAAAGGC TCCATTACTT CTGTGCAGGC CGTATACGTG CCGGCCGATG 850 

ACATTACCGA TCCAGCTCCA GCAACAACCT TCTCACATCT GGACGCCACA 900 

ACTGTGTTGG ACCGTGCTGT TGCCGAGTCT GGTATCTACC CCGCTGTTAA 950 

CCCACTGGAA TGCGCCTCGC GTATCATGGA CCCCGACGTT ATCAGTGTGG 1000 

ATCACTACAA TGTTGCACAA GATGTGGTAC AGATGCTCAC CAAGTACAGG 1050 

GAATTACAGG ATATCATTGC TGTCCTTGGT ATCGACGAGC TAAGCGAGGA 1100 

GGACAAACTT ATCGTGGACC GTGCGCGTAA GTTGGTGAAG TTCCTCTCCC 1150 

AGCCATTCCA AGTTGCTGAG. GTCTTCACAG GAATGACTGG CCATTACGTG 1200 

CAGTTGGATG ACACCATCGA TTCCTTTTCT GGTCTCCTCA TGGGTACG 1248 



2) INFORMATION FOR SEQ ID NO: 2191 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 440 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus 

(B) STRAIN: WSA-176 

(xi) SEQUENCE DESCRIPTION: SEQ 

GAGCTGATTG TAAGTCATCG ACCACTTGAT 
AGTAGAACAA CATCGCCAAG GCTCACGGTG 
GTCGGTGAGC GTACTCGTGA GGGTAACGAT 
GACTGGTGTC ATTCAGCTCG ACGGCGAATC 
GTCAGATGAA CGAGCCCCCA GGTGCTCGTG 
CTGACCATCG CCGAATACTT CCGTGACGAG 
CTTCATTGAC AACATTTTCC GTTTCACCCA 
CCCTTCTTGG TCGTATCCCC TCTGCCGTCG 
GTCGACATGG GTGGTATGCA GGAACGTATT 



nidulans 
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ATATGAATAC 


ATCTAACAGT 


50 


GTTACTCCGT 


CTTCACTGGT 


100 


CTGTACCACG 


AAATGCAGGA 


150 


CAAGGTGTCT 


CTTGTGTTCG 


200 


CCCGTGTCGC 


CCTTACTGGT 


250 


GAGGGTCAGG 


ACGTGCTGCT 


300 


GGCCGGTTCC 


GAGGTGTCTG 


350 


GTTACCAGCC 


CACTCTGGCC 


400 


ACCACCACCA 




440 



2) INFORMATION FOR SEQ ID NO: 2192 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1262 bases 
-(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania panamensis 

(B) STRAIN: ATCC 50158 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2192 

GCTGACGGCG CTGGACGTGA CGGAGGACCT CGGCCGCGAT GAGCCGCTGA 50 

CGCTGGAGAT CGTGCAGCAC TTGGATGCGA ACACCGGCCG CTGCATTGCG 100 

ATGCAGACGA CGGACCTGCT GAAGCTGAAG TCGAAGGTTG TGTCGACCGG 150 

CGGCAACATC TCCGTGCCGG TGGGCCGCGA GACGCTGGGC CGCATCTTCA 200 

ACGTGCTGGG CGACGCGATT GACCACCGCG GCCCCGTGTG CGAGAAGATG 250 

CGCATGGCGA TCCACGCCGA GGCGCCGAAG CTGGCGGACC AGGCTGCGGA 300 

GGACACGATC CTGACGACCG GCATCAAGGT GATCGACCTG ATTCTGCCCT 350 

ACTGCAAGGG CGGCAAGATC GGCCTGTTCG GCGGTGCCGG TGTGGGCAAG 4 00 

ACTGTGATCA TCATGGAGCT GATCAACAAC GTCGCGAAGG GGCACGGCGG 4 50 

CTTCTCCGTG TTCGCCGGCG TTGGCGAGCG CACGCGCGAG GGCACGGACC 500 

TGTACCTGGA GATGATGCAG TCAAAGGTGA TTGACCTGAA GGGCGAGTCG 550 

AAGTGCGTGC TTGTGTACGG GCAGATGAAC GAGCCCCCGG GTGCGCGCGC 600 

GCGCGTTGCG CAGTCTGCGC TGACGATGGC CGAGTACTTC CGCGACGTGG 650 

AGGGCCAGAA CGTGCTGCTG TTCATCGACA ACATCTTCCG CTTCACGCAG 700 

GCGAACTCCG AGGTGTCTGC GCTGCTGGGC CGCATCCCGG CCGCCGTGGG 750 

TTACCAGCCG ACGCTTGCGG AGGATCTTGG CATGCTGCAG GAGCGCATTA 800 

CGTCGACGAC GAAGGGATCG ATTACGTCTG TGCAGGCTGT GTACGTGCCT 850 

GCGGATGATA TCACGGACCC CGCGCCCGCG ACGACGTTCT CGCACCTGGA 900 

CGCGACGACT GTGCTGGACC GCGCGGTGGC GGAGTCGGGC ATCTACCCTG 950 

CCGTGAACCC GCTGGAGTGC GCGTCGCGCA TCATGGACCC CGATGTGATC 1000 

GACGTGGACC ATTACAACGT TGCACAGGAT ATCGTCCAGA TGCTGACCAA 1050 

GTACAAGGAG CTGCAGGACA TCATTGCGGT GCTTGGCATC GACGAGCTGA 1100 

GCGAGGAGGA CAAGGTCGTG GTGGACCGCG CGCGCAAGGT GACGCGGTTC 1150 

CTGTCGCAGC CGTTCCAGGT TGCGGAGGTG TTCACCGGCA TGACGGGCCA 1200 

CTACGTGCAG CTGAGCGACA CGGTGGAGTC GTTCTCTGGC CTGCTGATGG 1250 

GGTCGTACGA CC 1262 



2) INFORMATION FOR SEQ ID NO: 2193 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN: WSA-176 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2193 

TTCCGATGGT CAAATGTACG ATTGATATTC CTTCCAGCCA GTCAGGATAA 50 

CAGCTGATAC CAGTTGCAAA TAGGCCCCAG ACTCGTGAGC ACTTGTTGCT 100 

TGCCCGTCAG GTTGGTGTCC AGAAGATCGT TGTCTTCGTC AACAAGGTTG 150 

ACGCTGTCGA TGACCCTGAG ATGTTGGAGC TTGTTGAGCT CGAGATGCGT 200 
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GAGCTCCTCA 
CGGTTCCGCC 
AGCGTATTGA 
CAGCGTGACT 
CATTGGTGGT 
TCAAGAAGGA 
ACCAAGGTCA 
TGCTGGTGAC 
TTCGTCGTGG 
AAGTTCATGG 
GAACGGCTTC 
GTAAGTTTCG 
AGACGAGGCT 
ACGTCATGCC 
GTTGCTGCTG 



ACACTTACGG 
CTGTGCGCTC 
CTCCCTCCTC 
TGGACAAGCC 
CGTGGTACCG 
TACCGAAGTT 
CCGACATTGA 
AACTCCGGTC 
TATGGTCATC 
TCTCCATGTA 
GGTGCCAACT 
AACTATTTGA 
TGCGACCTTC 
CGGTGACAAC 
AG 



TTTCGAGGGA 
TCGAAGGCCG 
GAGGCCGTTG 
CTTCCTGATG 
TCGCCTCTGG 
GAAATTCACG 
GACCTTCAAG 
TTCTCCTCCG 
GCTGCCCCTG 
CGTCTTGACT 
ACCGCCCCCA 
TTCATTGATC 
ATTTCCCTGA 
GTCGAAATGG 



GAGGAGACCC CTATCATCTT 
CCGCGAGGAC ATTGGTACTC 
ACACTTGGAT CCCTACCCCC 
TCCATTGAGG AAGTTTTCTC 
TCGTGTCGAG CGTGGTCTCC 
GTGCTGATGG TATTCTGAAG 
AAGAGCTGCG ATGAGTCTCG 
TGGTATCCGT CGTGAGGATG 
GCTCCATCAA GGCCTCCAAG 
GAGGCTGAAG GTGGCCGCAA 
GGCTTTCATC CGCACTGCTG 
ACGTCCCTAA CTGTTACTTT 
TGAGGCCGAC AAGGACCGCC 
TCCTCAACCT CAACAACCCC 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
912 



2) INFORMATION FOR SEQ ID NO: 2194 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 887 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aureobasidium pullulans 

(B) STRAIN: WSA-234 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2194 

TCATCGTCGT CGCCGCTTCT GACGGTCAGA TGCCCCAGAC CAGAGAGCAC 50 

TTGCTTCTCG CCCGTCAGGT CGGTATCCAG AAGATTGTCG TCTTCGTCAA 100 

CAAGGTCGAC GCCATCGAGG ACAAGGAGAT GCTTGAGCTC GTCGAGATGG 150 

AGATGCGTGA GCTCCTCAGC AGCTACGGCT TCGAGGGTGA CGAGACCCCC 200 

ATCGTCATGG GTTCCGCTCT TTGTGCCCTT GAGAACCGCC AGCCCGAGAT 2 50 

TGGAACCACC CAGATCGACA ACCTGATGAA CGCTGTCGAC GAGTGGATCC 3 00 

CCACTCCCCA GAGAGATCTT GAGAAGCCCT TCCTCATGTC CGTTGAGGAT 350 

GTCTTCTCTA TCCCCGGTCG TGGTACTGTC GTTTCTGGCC GTGTTGAGCG 400 

TGGTACCCTG AAGAAGGATT CCGAAATCGA GCTTGTCGGC AAGAACAAGG 450 

TCCCCATCAA GACCAAGGTC ACCGACATCG AGACCTTCAA GAAGTCTTGC 500 

GACGAGTCCC GCGCTGGTGA CAACTCCGGT CTTCTGCTCC GTGGTATCAA 550 

GCGTGAGGAT GTCAACCGTG GTATGGTTGT CGTCAAGCCC GGTACCGTCA 600 

CCTCGCACAA GAAGTTCCTC GTCTCCATGT ACGTCCTGAC CAAGGAGGAG 650 

GGTGGTCGTC ACACTGGTTT CCACGGAAAC TACCGTCCCC AGATCTTCAT 7O0 

CCGTACCGCT GGTAAGTCCT GACTTTGAAC TGCTGACCAA TTTTCGCATC 750 

TCTAACATGT TTTACAGACG AGGCTGCCGC TATTGACTGG CCCGAAGGCA 800 

CCGAGGACGC TGACTCCAAG ATGGTCATGC CCGGTGACAA CGTCGAGATG 850 

GTCTGCTCTC TTCACAGACC TCTTGCCGTT GAACAGG 887 



2) INFORMATION FOR SEQ ID NO: 2195 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 984 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Emmons ia pa rva 

(B) STRAIN: ATCC 10784 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2195 

TGACGGCCAA ATGTAGGACT CTTGCGCGGG ACTGACTGAT TGGGGGGAAT 50 

CCATTTTTTC TTTTTTCTTT TTCTTTTTGA GTACATGATT ATACTAATAT 100 

TTGGATAACG TACCAGGCCT CAAACACGTG AACATTTACT CCTTGCCCGA 150 

CAAGTCGGTG TCCAGAAGAT CGTCGTTTTC GTAAACAAGG TCGATGTTCT 2 00 

CGAAGATAAG GAGATGTTGG AGCTTGTCGA GTTGGAAATG AGAGAGCTCT 250 

TGAACACCTA CGGATTCGAG GGTGAGGAAA CGCCCATCAT CTTCGGTTCT 300 

GCCCTTTGTG CCATGGAGGG CCGCGAGCCT GAGCTGGGCG AGAAGAGAAT 350 

TGATGAATTG CTCGACGCTG TTGATAGCTG GATCCCTACG CCGCAACGTG 4 00 

ATACGGAAAA ACCCTTCCTA ATGTCCATTG AGGAAGTGTT CTCCATCTCC 4 50 

GGTCGTGGAA CCGTTGCCTC CGGCCGTGTT GAGCGTGGTG TCCTCAAGAA 500 

GGATTCCGAA GTCGAGCTTG TTGGCGGCGG CGTCGCCCCA ATCAGGACCA 550 

AGGTAACCGA TATCGAAACC TTCAAGAAGT CCTGCGACGA GTCCAGGGCT 6 00 

GGAGACAACT CTGGCCTTTT GTTGCGTGGT GTCAAGCGTG AGGATATCCG 650 

CCGTGGTATG GTCGTTGTCG TTCCTGGCAG CGTCAAGGCC CACGACAAGT 700 

TCTTGGTGTC CATGTATGTT CTGACCGAAG CCGAGGGTGG TCGCCGAACT 750 

GGATTCGGCC AAAACTATCG TCCTCAAATG TTTATCCGCA CAGCTGGTAC 800 

GTAACGTATA ATGCCTCTCC TCTTCATATA TACCACCCCC CCCACCACTG 850 

ACTCCCTGAC TCTTCGATTA CAGACGAGGC CGCCGATCTC AGCTTCCCTG 900 

ACGCAGCAGA CGAAACCAAA CTGGTTATGC CCGGTGACAA CGTCGAGATG 950 

ATCCTCAAGA CACACCGCCC CATAGCTGCC GAAG 984 



2) INFORMATION FOR SEQ ID NO: 2196 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exserohilum rostra turn 

(B) STRAIN: WSA-215 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2196 

GCCGCCTCTG ACGGCCAAAT GCCCCAGACC CGTGAGCACT TGCTGCTTGC 50 

TCGCCAGGTC GGTGTTCAGA AGATTGTTGT TTTTGTCAAC AAGGTCGATG 100 

CCGTCGAGGA CAAGGAGATG TTGGAGCTCG TTGAGATGGA GATGCGTGAG 15 0 

TTGCTCAGCA GCTATGGCTT CGAGGGCGAT GAGACTCCCA TTGTCATGGG 200 
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CTCCGCTCTC TGCGCCATCG AGGGCCGTGA 
GTATTGATGA GCTGCTCGAG GCTGTCGACA 
CGTGATACCG ACAAGCCCTT CCTCATGGCT 
TGCTGGCCGT GGTACCGTCG TTTCTGGCCG 
AGCGCGATGC CGAAGTCGAG CTGGTTGGCA 
ACCAAGGTTA CCGACATTGA AACCTTCAAG 
CGCCGGTGAC AACTCGGGTC TCCTCCTCCG 
TCCGCCGTGG CATGGTCGTT TCTGTTCCTG 
AAGTTCCTCG TCTCCATGTA TGTGTTGAGC 
CACGGGCTTC GGCGAGAACT ACAGGCCACA 
ACGAGTCATG CGCGCTGCAC TGGCCAGAAG 
AAGCTTGTTA TGCCTGGTGA TAACGTTGAG 
GCCACA 



PCT/CAOO/01150 

GCCCGAAATT GGTGTCAACC 250 

CCTGGATCCC CACCCCTCAG 300 

GTTGAGGACG TCTTCTCCAT 350 

TGTCGAGCGA GGTGTCTTGA 4 00 

AGGGCACTGC GCCCATCAAG 450 

AAGTCGTGTG AAGAGTCTCG 500 

TGGTGTCAAG CGTGACGACG 550 

GACAAGTCAA GGCCCACAAG 600 

AAAGAGGAAG GTGGCCGCCA 650 

AATGTTCATC CGTACTGCTG 700 

GTACCCCAGA TGCTCACGAC 750 

ATGGTTTGTG AGCTTCACGT 800 

806 



2) INFORMATION FOR SEQ ID NO: 2197 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium monili forme 

(B) STRAIN: WSA-213 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2197 

ACCCGTGAAC ACTTGCTCCT CGCTCGTCAG GTTGGTGTTC AGCGAATTGT 50 

CGTCTTTGTC AACAAGGTCG ATGCCATTGA TGACCCCGAG ATGCTTGAGC 100 

TCGTCGAGAT GGAGATGCGC GAGCTTCTTA ACACCTATGG CTTCGAAGGC 150 

GACGACACTC CCGTCATCAT GGGCTCGGCT CTCATGTCTC TCCAGAACCA 200 

GCGCCCTGAG ATTGGCACCG AGAAGATTGA TGAGCTTCTT GCTGCCGTCG 250 

ACGAGTGGAT CCCAACCCCC GAGCGTGACC TTGACAAGCC CTTCCTTATG 300 

TCCGTCGAGG ATGTCTTCTC CATTGCTGGC CGTGGTACCG TCGTGTCTGG 350 

CCGTGTGGAG CGTGGTGTTC TGAAGCGTGA CCAGGAGATC GAGCTTGTTG 4 00 

GAAAGGGTCA GGAGGTTATC AAGACCAAGG TTACCGACAT CGAGACCTTC 4 50 

AAGAAGTCTT GTGAGCAGTC CCAGGCTGGT GACAACTCTG GTCTCCTCAT 500 

CCGAGGTGTT CGCCGTGAGG ATGTCCGCCG TGGTATGGTC GTCTGCGCTC 550 

CTGGCACCGT GAAGTCTCAC ACCCAGTTCC TCGCTTCCCT CTATGTCCTC 600 

TCCAAGGAGG AGGGTGGCCG ACACACCGGT TTCCAGGAGC ACTACCGACC 650 

CCAGCTCTAC CTCCGAACCG CAGATGAGTC CATTGACCTG ACTTTCCCTG 700 

AGGGTACTGA GGATGCCTCC AGCAAGATGG TCATGCCTGG CGACAACACC 750 

GAGATGGTTG TCACCATGGG TCACCCCAAT GCCATCGAGG TTGGTCAGC 799 



2) INFORMATION FOR SEQ ID NO: 2198 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium solani 

(B) STRAIN: ATCC 32793 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2198 

CTCTGACGGT CAGATGCCCC AGACCCGTGA GCACTTGCTG CTTGCCCGTC 50 

AGGTCGGTGT CCAGAAGATT GTCGTCTTCG TCAACAAGGT CGATGCCATT 100 

GACGACCCTG AGATGCTTGA GCTCGTCGAG, ATGGAGATGC GTGAGCTCCT 150 

CAACACCTAC GGCTTCGAGG GTGACGAGAC CCCTGTCATC ATGGGCTCTG 200 

CTCTCATGTC CCTCCAGAAC CAGCGCCCCG AGATCGGTAG CCAGAAGATT 250 

GACGAGCTCC TTGCCGCCGT TGACGAGTGG ATCCCTACCC CCGAGCGTGA 3 00 

CCTTGACAAG CCCTTCCTCA TGTCCGTTGA GGATGTCTTC TCCATTGCCG 350 

GCCGTGGTAC CGTCGTCTCT GGCCGTGTCG AGCGTGGTGT CCTGAAGCGC 400 

GACCAGGAGA TTGAGCTCGT CGGCAAGGGT AACGAGGTCA TCAAGACCAA 450 

GGTCACCGAC ATTGAGACCT TCAAGAAGTC TTGCGAGCAG TCCCAGGCTG 500 

GTGACAACTC TGGTCTCCTC ATCCGAGGTG TCCGCCGTGA GGATGTCCGC 550 

CGTGGTATGG TCGTCTGCGC CCCCGGCACT GTCAAGTCCC ACACTCAGTT 600 

CCTTTCTTCC CTCTACGTCC TCACCAAGGA GGAGGGTGGC CGACACACTG 650 

GCTTCCAGGA GCACTACCGA CCCCAGCTCT ACCTCCGAAC TGCTGATGAG 700 

TCCATCGACC TGACCTTCCC CGAGGGTACC GAGGACGCCA GCAGCAAGAT 750 

GGTCATGCCC GGTGACAACA CCGAGATGGT CATCACCATG GGCCACCCCA 800 

ACGCCATTGA GGTCGGTCA 819 



2) INFORMATION FOR SEQ ID NO: 2199 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1025 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: WSA-377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2199 

TGCTGACGGC CAAATGTAAG ACGCCGCGAG GGAGTGCTGA AGGTTTTATG 50 

CTTTTTAGGC CCTTTGTTTC TGAGAGCATG ATGATACTAA TACTCGGAAA 100 

CGTATCTATT AGGCCTCAAA CACGTGAGCA TTTGCTCCTT GCCCGACAGG 150 

TCGGTGTCCA AAAGATCGTC GTTTTCGTGA ACAAAGTCGA CGCCCTTGAG 200 

GACAAGGAGA TGTTGGAGCT TGTCGAGTTA GAAATGAGAG AGCTCTTAAA 250 

CACCTACGGA TTTGAGGGTG AAGAGACACC CATCATCTTT GGTTCTGCCC 300 

TTTGCGCCAT GGAAGGCCGT GAGCCTGAGT TGGGAGAAAA GAAAATTGAT 350 

GAATTGCTGG AGGCTGTTGA TACTTGGATC CCAACACCAC AACGTGATAC 400 

CGAAAAACCT TTCTTGATGT CCGTTGAGGA AGTATTCTCT ATCTCCGGTC 450 

GTGGAACCGT TGCCTCCGGT CGTGTTGAGC GCGGTGTCCT CAAGAAGGAT 500 
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TCAGAAGTCG AGCTAATTGG GGGCGGCTCC 
AACTGATATC GAAACTTTCA AGAAATCCTG 
ACAACTCCGG TCTTTTATTG CGTGGTATCA 
GGTATGGTAG TTGCCGTTCC TGGCAGCGTC 
GGTGTCGATG TATGTCCTGA CCGAAGCTGA 
TCGGCCAGAA CTATCGTCCT CAAATGTTCA 
AAATGGGGCC CCTTTTCATA ATCCTTTCTT 
ATCTCTCTCT CTGTTTCTTT TCAACTCGCC 
CCCCTTTGAT TATAGACGAA GCCGCCCATC 
GATGAAAGCA AACTCGTTAT GCCTGGTGAC 
GACACACCGC CCCGTGGCTG CTGAG 
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ACCCCCATCA GGACGAAGGT 550 

TGACGAGTCT AGAGCTGGGG 600 

AGCGTGAAGA TATCCGCCGT 650 

AAGGCCCACG ACAAGTTCTT 700 

GGGTGGTCGC CGAACCGGAT 750 

TCCGCACAGC TGGTATGTCA 800 

TTTTTCCTTT TCCTCTCTCT 850 

TGATTCACGA AATTAACTAA 900 

TCAGCTTCCC TAGTGGAGCA 950 

AACGTCGAGA TGATCCTCCA 1000 

1025 



2) INFORMATION FOR SEQ ID NO: 2200 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kocuria kristinae 

(B) STRAIN: ATCC 27570 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2200 . 

AGCACGTGCT GCTCGCCCGC CAGGTCGGCG TGCCGACCCT GCTGGTCGCC 50 

CTGAACAAGG CCGACATGGT CGAGGACGAG GAGCTGCTGG ACCTCGTCGA 100 

GATGGAGGTC CGCGAGCTGC TGTCCTCCCA GGAGTTCGAC GGCGACAACG 150 

CCCCCGTCAT CCGCGTCTCC GCGCTGAAGG CGCTGGAGGG CGACGAGAAG 200 

TGGGTCAAGT CCATCGAGGA GCTCATGGAG GCCGTGGACG AGTACATCCC 250 

GGACCCCGTG CGCGACAAGG ACAAGCCGTT CCTGATGCCC ATCGAGGACG 300 

TCTTCACCAT CACCGGGCGC GGCACCGTGG TGACCGGTCG CGCCGAGCGC 350 

GGGACCCTGG CCCTGAACTC CGAGGTCGAG ATCGTCGGCA TCCGCCCGAT 4 00 

CCAGAAGACC ACGGTCACCG GGATCGAGAT GTTCCACAAG CAGCTCGACG 450 

AGGCCTGGGC CGGCG AG AAC TGCGGTCTGC TGCTGCGCGG CCTGAAGCGC 500 

GACGACGTCG AGCGCGGCCA GGTCGTGGTG AAGCCGGGTT CCATCACCCC 550 

GCACACCAAC TTCGAGGCGA ACGTCTACAT CCTGTCCAAG GACGAGGGTG 600 

GGCGTCACAA CCCGTTCTAC TCGAACTACC GTCCGCAGTT CTACTTCCGG 650 

ACCACCGACG TCACCGG 667 



2) INFORMATION FOR SEQ ID NO: 2201 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio mimicus 

(B) STRAIN: ATCC 33653 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2201 

TGCTGCAACA GATGGTCCAA TGCCACAAAC TCGTGAGCAC ATCCTGCTGG 50 

GTCGCCAAGT AGGTATTCCT TACATCATCG TATTCATGAA CAAATGTGAC 100 

ATGGTTGACG ATGAAGAGCT TCTAGAGCTG GTTGAGATGG AAGTTCGTGA 150 

GCTTCTGTCT GAGTACGATT TCCCAGGTGA TGACCTGCCA GTAATCCAAG 200 

GTTCAGCACT AGGCGCGCTA AACGGCGAAG CACAGTGGGA AGCGAAGATT 250 

GTTGAACTAG CAGAAGCACT AGATTCATAC ATTCCAGAGC CAGAGCGTGC 300 

AGTAGACATG GCATTCCTGA TGCCAATCGA AGACGTATTC TCAATCCAAG 350 

GTCGTGGTAC AGTAGTAACT GGCCGTATCG AGCGCGGCAT CCTGAAAGTG 4 00 

GGTGACGAAG TTGCGATCGT TGGTATCAAA GACACAGTAA AAACTACCTG 4 50 

TACAGGTGTA GAAATGTTCC GTAAGCTGCT TGACGAAGGT CGTGCAGGTG 500 

AGAACGTTGG TGCACTGCTA CGTGGTACTA AGCGTGAAGA AGTAGAGCGT 550 

GGTCAAGTAC TGGCGAAGCC AGGTTCAATC ACCCCACACA CTAAGTTCGA 600 

ATCAGAAGTA TACGTACTGT CAAAAGACGA AGGTGGCCGT CATACTCCAT 650 

TCTTCAAAGG TTACCGTCCA CAGTTCTACT TCCGTACAAC TGACGTAACA 700 

GGCAGCATCG AGCTTCCAGA AGGCGTAGAA ATGGTAATGC CAGGCGACAA 750 

CATCAAGATG GTTGTAGACC TGATTGCA 778 



2) INFORMATION FOR SEQ ID NO: 2202 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter freundii 

(B) STRAIN: ATCC 8090 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2202 



AACGCTGACC CTGCAGGTTA TTGCCGCTGC 
GTGCATTTAT CGATGCAGAA CACGCACTGG 
CTTGGCGTTG ATATCGATAA CCTGCTGTGT 
ACAAGCGCTG GAAATCTGTG ATGCACTGGC 
TTATCGTTGT CGACTCCGTT GCCGCATTGA 
GGCGAGATTG GCGACTCTCA CATGGGCCTT 
GGCGATGCGT AAGCTGGCCG GTAACCTGAA 
TTTTCATCAA CCAGATCCGT ATGAAGATTG 
GAAACCACCA CC 



GCAGCGTGAA GGTAAAACCT 50 

ACCCGGTCTA TGCCCGTAAG 100 

TCTCAGCCGG ATACCGGTGA 150 

GCGCTCCGGT GCGGTTGACG 200 

CGCCGAAGGC AGAAATCGAA 250 

GCGGCGCGTA TGATGAGCCA 300 

GCAGTCCAAC ACGCTGCTGA 350 

GCGTTATGTT CGGTAACCCG 4 00 

412 



2) INFORMATION FOR SEQ ID NO: 2203 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 337 bases 
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(B) TYPE: Nucleic* acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium 

(B) STRAIN: 20:1.2 

(xi) SEQUENCE DESCRIPTION: SEQ 

TTTAGATCCA TCTTATGCTA GAAATTTAGG 
TAGTTTCTCA ACCAGATACA GGAGAACAGG 
TTAGTAAGAT CAGGAGCAGT AGATGTTATA 
TTTAGTTCCT AGGGCAGAAA TAGAAGGAGA 
GTCTTCAAGC AAGACTTATG. TCTCAAGCCC 
ATAAATAAAT CTAAGTGTGT AGCTATATTT 
GGTTGGTATA ATGTTTGGAA ATCCAGAAAC 



botulinum 



ID NO: 2203 

TGTTGATATA GATAACCTAA 50 

CTTTAGAGAT AACAGAAGCT 100 

GTTGTAGACT CTGTAGCAGC 150 

AATGGGAGAC TCACATGTAG 200 

TAAGAAAATT AGCAGGATCT 250 

ATAAACCAAT TAAGAGAAAA 300 

AACTCCT 33 7 



2) INFORMATION FOR SEQ ID NO: 2204 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 379 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Francisella tularensis 

(B) STRAIN : ATCC 29684 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2204 

AAAGCAAGGC GGTACTGCAG CATTTGTTGA TGCTGAGCAT GCACTAGATC 50 

CAAAATATGC AAAGCTTTTA GGTGTTGATG TTGATAATCT GATCGTGTCA 100 

CAGCCGGATA CGGGTGAGCA AGCTTTAGAG ATTGCTGATA TGTTGGTACG 150 

TTCTGGAGGA GTTGATATTG TAGTAATTGA CTCTGTTGCT GCACTTACGC 200 

CAAAGGCAGA GATTGAGGGT GACATGGGCG ACTCGCACAT GGGCTTACAA 250 

GCAAGATTAA TGTCACAAGC ACTAAGAAAA CTAACGGCAA ATATCAAGCG 300 

CTCAAATACT CTAGTGATAT TCATTAACCA AATTCGTATG AAGATCGGGG 350 

TTATGTTTGG TAACCCTGAA ACTACAACT 379 



2) INFORMATION FOR SEQ ID NO: 2205 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus anaerobius 

(B) STRAIN: ATCC 27337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2205 



ACTTGACCCA GTATATGCAA GGGCTCTTGG AGTGGATATA GACAACCTAG- 50 

TCATATCTCA GCCAGATACA GGAGAACAGG CCCTAGATAT AGCAGAGTCC- 100 

CTTATAAGAT CAGGAGCTGT AGATATACTA GTAATAGACT CAGTAGCTGC- 150 

CCTAGTACCT AAGGCAGAAA TAGAAGGTGA CATGGGAGAT TCTCACGTAG- 200 

GTCTACAGGC TAGACTTATG TCACAGGCAC TTAGAAAATT GACTGGATCT - 250 

ATAAAGAAGT CAAACTGTGT TGTTATATTT, ATCAACCAGT TGAGAGAAAA- 300 

AGTAGGGGTT ATGTTCGGTA ATCCAGAGAC AACAACA 337 



2) INFORMATION FOR SEQ ID NO: 2206 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus asaccharolyticus 

(B) STRAIN: LSPQ 2639 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2206 



TCTTGATGCT GGATATGCAA AAAACCTTGG AGTAGATGTA GAAAATTTAA- 50 

TTATTTCTCA ACCTGATACA GGTGAGCAAG CCTTAGAAAT AACTGAAGCT- 100 

CTTGTAAGAT CTAACGCTGT TGATTTAATT ATTATAGACT CAGTTGCCGC- 150 

ACTTGTACCA AAAGCAGAAA TCGATGGTGA CATGGGAGCT GCACAAATAG- 200 

GTCTTCAAGC AAGACTTATG TCTCAAGCTC TTAGAAAATT AACTGGGGCA- 250 

ATCAACAAGT CAAAATGTAC CGTTGTATTT ATTAACCAAC TTAGAGAAAA- 300 

AGTTGGTATC ATGTTTGGTA ACCCAGAAAC TACAACA 337 



2) INFORMATION FOR SEQ ID NO: 2207 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 408 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia stuartii 
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(B) STRAIN: ATCC -33672 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2207 



CTCACGTTGC AAGTTATTGC 
ATTTATCGAC GCTGAACATG 
GTGTTGATAT CGATAACCTT 
GCATTAGAGA TTTGTGATGC 
TATCGTTGAC TCCGTGGCCG 
AAATCGGTGA CTCACACATG 
ATGCGTAAAT TAGCGGGTAA 
CATTAACCAA ATCCGTATGA 
CCACTACA 



AGCAGCACAA CGTAGCGGAA AAACCTGTGC 
CGCTAGATCC AATCTATGCG AAAAAACTGG 
CTATGTTCTC AACCTGATAC TGGTGAGCAA 
ACTGACGCGT TCAGGCGCTG TTGATGTCAT 
CATTAACACC AAAAGCTGAA ATTGAAGGTG 
GGCTTAGCGG CTCGTATGAT GAGCCAAGCG 
CTTAAAGAAC TCGAATACAC TTTTAATCTT 
AGATTGGCGT TATGTTTGGT AACCCAGAAA 



50 
100 
150 
200 
250 
300 
350 
400 
408 



2) INFORMATION FOR SEQ ID NO: 2208 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Paratyphi A 

(B) STRAIN: ATCC 9150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2208 

GCTGCAGGTG ATTGCCGCTG CGCAGCGTGA AGGTAAAACC TGTGCGTTTA 50 

TCGATGCGGA ACACGCGCTT GACCCTGTTT ACGCACGCAA GCTGGGCGTC 100 

GATATCGATA ACCTGCTTTG TTCTCAGCCG GATACCGGCG AGCAGGCGCT 150 

GGAAATCTGT GACGCGCTGG CGCGTTCAGG CGCGGTGGAC GTCATTGTGG 200 

TCGACTCCGT AGCGGCGCTA ACGCCGAAAG CGGAAATCGA AGGCGAAATT 250 

GGCGACTCTC ACATGGGCCT CGCGGCGCGT ATGATGAGCC AGGCGATGCG 300 

TAAGCTGGCG GGGAACCTAA AACAGTCCAA CACGCTGTTG ATTTTCATCA 350 

ACCAGATCCG TATGAAGATT GGCGTGATGT TCGGTAACCC GGAAACCACC 4 00 

ACC 403 



2) INFORMATION FOR SEQ ID NO: 2209 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhimurium 

(B) STRAIN: ATCC 14028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2209 



2) INFORMATION FOR SEQ ID NO: 2210 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophytics 

(B) STRAIN: ATCC 15305 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2210 

ATTAGATCCT GTTTACGCAG AAGCTTTAGG CGTAGATATA CAAAATTTAT- 50 

ATTTATCTCA ACCTGATCAT GGGGAACAAG GTTTAGAAAT TGCCG AAGCA - 100 

TTTGTTAGAA GTGGCGCTGT TGATATCGTT GTGGTCGATT CAGTTGCTGC- 150 

GCTTACACCT AAAGCTGAAA TTGAAGGTGA AATGGGAGAT ACGCACGTTG- 200 

GTTTGCAAGC ACGTCTTATG TCCCAAGCCT TGAGAAAGCT TTCCGGTGCA- 250 

ATTTCAAAAT CAAATACAAC AGCAGTATTT ATCAACCAAA TCCGTGAAAA- 3 00 

AGTTGGTGTG ATGTTCGGTA ATCCTGAAGT TACACCA 337 



2) INFORMATION FOR SEQ ID NO: 2211 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 
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GACCCTGACG CTGCAGGTGA 
GTGCGTTTAT CGATGCGGAA 
CTGGGCGTCG ATATCGATAA 
GCAGGCGCTG GAAATCTGTG 
TCATTGTGGT CGACTCCGTA 
GGCGAAATCG GCGACTCTCA 
GGCGATGCGT AAGCTGGCGG 
TTTTCATCAA CCAGATCCGT 
GAAACCACCA CC 



TTGCCGCTGC GCAGCGTGAA GGTAAAACCT 
CACGCGCTTG ACCCTGTTTA CGCACGCAAG 
CCTGCTCTGC TCTCAGCCGG ATACCGGCGA 
ACGCGCTGGC GCGTTCAGGC GCGGTGGACG 
GCGGCGCTAA CGCCGAAAGC GGAAATCGAA 
CATGGGCCTC GCGGCGCGTA TGATGAGCCA 
GGAACCTGAA ACAGTCCAAC ACGCTGTTGA 
ATGAAGATTG , GCGTGATGTT CGGTAACCCG 



50 
100 
150 
200 
250 
300 
350 
400 
412 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2211 



GACACTGACA TTACAGGTTA 
GTGCATTTAT CGATGCCGAA 
. TTGGGTGTAG ATATTGATAA 
GCAGGCACTG GAAATTTGTG 
TTATCATCGT TGACTCCGTA 
GGTGAAATTG GCGATTCTCA 
GGCTATGCGT AAGCTGGCGG 
TTTTTATCAA CCAAATCCGC 
GAAACCACTA CC 



TCGCCGCCGC ACAGCGTGAA GGCAAAACGT 
CATGCCCTTG ACCCAATCTA TGCCAAGAAA 
CCTACTGTGT TCTCAGCCAG ATACTGGCGA 
ATGCGCTGAC TCGCTCTGGT GCGGTTGACG 
GCGGCATTGA CACCAAAAGC TGAAATTGAA 
TATGGGCCTT GCCGCGCGTA TGATGAGCCA 
GTAACCTGAA GAATGCGAAT ACCTTACTGA 
ATGAAAATTG GCGTGATGTT TGGTAACCCA 



50 
100 
150 
200 
250 
300 
350 
400 
412 



2) INFORMATION FOR SEQ 



ID NO: 2212 



(i) 



SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 404 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2212 

CACGCTCACC TTGCAAACCA TCGCGGAAAT GCAAAAACTG GGCGGCACCT 50 

GCGCGTTTAT CGACGCCGAG CACGCACTGG ACGTCACGTA CGCGCAAAAG 100 

CTGGGCGTCA ACCTGAGCGA TCTGCTGATC TCGCAACCGG ACACCGGCGA 150 

ACAAGCGCTG GAAATCTGCG ACGCCCTGGT GCGTTCCGGT TCGGTGGACA 2 00 

TGGTCGTGAT CGACTCGGTC GCCGCGCTGA CCCCGCGCGC CGAGATCGAA 2 50 

GGCGACATGG GCGATTCGCT GCCAGGTTTG CAGGCACGTT TGATGTCGCA 300 

AGCACTGCGC AAGCTTACCG GTTCGATCAA CCGCACCAAC ACCCTGGTCA 350 

TCTTCATCAA CCAGATCCGC ATGAAAATCG GCGTCATGTT CGGCAGCCCG 4 00 

GAAA 4 04 



2) INFORMATION FOR SEQ ID NO: 2213 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2213 
CGTGCCATTG ACATGATTTC CGAAGAAGAC GCTGAAGGCA CG 
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2) INFORMATION FOR SEQ ID NO: 2214 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2214 

CAACTACATT ACGTTCTGCA ACACAAGGTC GTGGTACTTT CAGTATGACA 50 
TTTGACCACT ATGAAGATGT TCCTAAGAGC ATTGCAGAAG AAATCATCAA 100 
GAAAAATGGC GGTAACGGAG AATAA 125 



2) INFORMATION FOR SEQ ID NO: 2215 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: ATCC 19606 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2215 

CGACTCAAAT GCGTTCTATG TCTCAAGGTC GTGCGACATA CTCAATGGAA 50 
TTTGCTAAAT ATGCTGAAAC TCCACGTAAC GTGGCTGAAG GCATCATCGC 100 
TAAATTCCAA GCTGGCGGTA AAAAAGGTGA CGACGAGTAA 14 0 



2) INFORMATION FOR SEQ ID NO: 2216 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2216 



CCGGTGACCT GCGTTCTAAG ACGCAGGGTC GCGCTGTCTA CTCCATGGAG 
TTCGACAGCT ACGCCGAGGT TCCGCGCGCG GTCGCGGATG AGATCGTCGG 
CAAGTCTCGG GGCAACTGA 



50 
100 
119 



2) INFORMATION FOR SEQ ID NO: 2217 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2217 

CGACCGACGT TCGCTCCATG TCCCAGGGTC GCGCAAGCTA CTCTATGGAA 50 
TTCAAAAAAT ACAACACAGC TCCGGCGCAC ATCGCTGAAA CTGTATCCAA 100 
AAAACAAGGC TGA 113 



2) INFORMATION FOR SEQ ID NO: 2218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2218 

GCGACCTGCG TTCCCGTACC CAGGGCCGTG CAAACTACAC CATGATCTTC 50 
GACTCCTACG CTGAGGTTCC TACCAACGTG GCAGCTGAGA TCGTGGCAGA 100 
GCGCAACGGC ACTGCCTAA 119 



2) INFORMATION FOR SEQ ID NO: 2219 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 2219 

AACTCAGCTG CGTTCTCTGA CCAAAGGTCG TGCATCATAC ACCATGGAAT 50 
TCCTGAAGTA TGATGATGCG CCTAACAACG TTGCTCAGGC CGTTATTGAA 100 
GCCCGTGGTA AGTAA 115 



2) INFORMATION FOR SEQ ID NO: 2220 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2220 

AACTCAGCTG CGTTCTCTGA CCAAAGGTCG TGCATCATAC ACCATGGAAT 50 
TCCTGAAGTA TGATGATGCG CCGAACAACG TTGCTCAGGC CGTTATTGAA 100 
GCCCGTGGTA AATAA 115 



2) INFORMATION FOR SEQ ID NO: 2221 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2221 



CAACTCACCT TCGTTCAGGT ACGCAAGGTC GTGGTGTATA CACTATGCAA 
TTTGACCACT ATGAAGAAGT TCCTAAATCT ATTGCTGAAG AAATCATTAA 
AGCTAATGGT GGA 



50 
100 
113 



2) INFORMATION FOR SEQ ID NO: 2222 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 

(B) STRAIN: ATCC 25291 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2222 

CGGCGACCTG CGGTCCAAGA CCCAAGGCCG GGCGAACTAC TCCATGGTCT 50 
TCGACTCCTA CGCCGAAGTG CCGGCCAACG TGTCGAAGGA GATCATCGCG 100 
AAGGCGACGG GTCAGTGA 118 

2) INFORMATION FOR SEQ ID NO: 2223 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: Mgor-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2223 

CCGGCGACCT GCGGTCCAAG ACGCAAGGCC GGGCGAACTA CTCCATGGTG 50 
TTCGACTCGT ACGCCGAAGT TCCGGCGAAC GTGTCGAAGG AGATCATCGC 100 
GAAGGCGACG GGCGAATAG 119 



2) INFORMATION FOR SEQ ID NO: 2224 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium kansasii 

(B) STRAIN: Mkan-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2224 

CGGCGACCTG CGGTCCAAGA CTCAAGGCCG GGCGAACTAC TCGATGGTGT 50 
TCGATTCCTA CGCCGAAGTG CCGGCTCAGG TGTCGAAGGA GATCATCGCG 100 
AAGGCGACTG GCGAGTGA 118 



2) INFORMATION FOR SEQ ID NO: 2225 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium terrae 

(B) STRAIN: Mter-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2225 

CGGAGACTTG CGGTCGAAGA CCCAGGGCCG GGCGAACTAC TCCATGGTGT , 50 
TCGACTCCTA CGCCGAAGTG CCGGCGCAGG TGGCGAAGGA GATTATCGCG 100 
AAGGCAACGG GCGAGTAA 118 



2) INFORMATION FOR SEQ ID NO: 2226 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2226 

GACCGACCTG CGTTCTGCAA CCCAAGGCCG CGCTACTTAC TCTATGGAGT 50 
TCAAGAAATA TTCTGAAGCT CCTGCCCACA TAGCTGCTGC TGTAACTGAA 100 
GCCCGTAAAG GCTAA 115 
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2) INFORMATION FOR SEQ ID NO: 2227 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2227 

AACTTCATTA CGTTCTAACA CGCAAGGTCG CGGTACTTAC ACAATGTACT 50 
TTGACCACTA TGCAGAAGTT CCTAAATCAA TTGCTGAAGA AATCATCAAG 100 
AAAAATAAAG GTGAATAA 118 



2) INFORMATION FOR SEQ ID NO: 2228 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2228 

AACTTCATTA CGTTCTAACA CTCAAGGTCG CGGTACTTAC ACTATGTACT 50 
TCGATCACTA TGCAGAAGTT CCAAAATCAA TTGCTGATGA TATCATCAAA 100 
AAAAATAAAG GTGAATAA 118 



2) INFORMATION FOR SEQ ID NO: 2229 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1630 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succinivibrio dextrinosolvens 

(B) STRAIN: ATCC 19716 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2229 

CGTTCAATGC GTGTTCTAGA CGGTGCAGTT ATGGTTTACT GTGCTGTGGG 50 

TGGTGTTCAG CCTCAGTCTG AAACCGTTTG GAGACAGGCT CAGAAGTACA 100 

AGGTTCCTCG TATTGCTTTC GTTAATAAGA TGGACCGTAC TGGTGCTAAT 150 

TTCCTACGTG TTGTAGAGCA GATTAAGACC CGTCTAAAGG GTAACCCTGT 2 00 

TCCTCTAATG TTACCTATCG GTAAAGAGGA CAGCTTTGTT GGTGTAGTTG 250 

ACCTAATCAA^GCGTAAGGCT ATCGACTGGG ATGAGGCATC TCAGGGTATG 300 

AAGTTTGAGT ACGTTGACAT TCCAGCAGAT ATGGTTGAGG AAGTTGAAGA 3 50 

GTGGCGTGCA AAGCTTGTAG AAGCAGCTGC, AGAAGCTAAC GACGAGCTGA 400 

TGGATAAATT CTTCGGTGGT GAAGAGCTGA CCGAGGAAGA GATCAAGGCT 450 

GCTCTACGTG AGCGTACTCT TCGCAACGAA ATTATTCCTA TGTGCTGCGG 500 

TTCAGCATTT AAGAACAAGG GTGTTCAGGC AATGCTTGAC GCTGTTGTTG 550 

AGTATCTTCC ATCTCCAGCA GATGTTCCTG CTGTTGAGGG TAAGACCCTA 600 

ACCGGTGAAG CTGATACTCG TAAGGCTGAC GATAAAGAGC CATTCTCTGC 650 

TTTAGCATTT AAGCTAGCAA ATGACCCATT CGTAGGTAAC TTAACATTCT 700 

TACGTTGCTA CTCAGGCTTT ATTAAGTCTG GTGACACTGT AATGAACTCA 750 

GATAAGCAGA AGCGTGAGCG TTTCGGCCGT CTAGTTCAGA TGCACGCTAA 800 

TGCTCGTAAT GAGGTTAGCG AGGTTTATGC AGGTGACATC GTTGCTGCTA 850 

TTGGTCTGAA GGAAACCGTT ACCGGTGATA CCTTATGTGA CCCAGAGCAT 900 

CCAATCATTC TTGAGTCAAT CGACTTTGCA GAGCCAGTTA TCTCTGTAGC 950 

AGTTGAGCCT AAGACCAAGG ACGATCAGGA GAAGATGGCT CTTGCTTTAC 1000 

AGCGTTTAGC AAAAGAAGAT CCTTCATTCC GCGTTCGTAC AGACGAAGAG 1050 

TCTGGCCAGA CCATTATTTC TGGTATGGGT GAGCTTCACC TAGACATCAT 1100 

TGTTGACCGT CTACGCCGTG AGTTCAAGGT TGAGTGTAAT CAGGGTAAGC 1150 

CACAGGTTGC ATACCGTGAG ACCATTAAGA GCAAGGTTGA ACAGCAAGGT 12 00 

AAGTTTGCTC GTCAGTCTGG TGGTCGTGGT CAGTACGGTG ACTGCTGGTT 1250 

ACGTATGGAA CCTCTTGAGC CAGGTAAGGG CTACGAATTC GTGAATGAGA 13 00 

TTGTTGGTGG TGTAATTCCT AAGGAATATA TCCCTGCAAT TGATAAGGGC 135 0 

TGTCAGGAGC AGATCGCTAA CGGTGTTCTA GCTGGTTTCC CAGTTGTTGA 1400 

CATCAAGATC ACTGTATTCG ATGGTTCTTA CCACGAAGTT GACTCTTCAG 1450 

AAATGGCATT CAAGATTGCT GCTTCTATGG CATTCAAAGA GGGCTTCAAG 1500 

AAGGCAAATC CTGTTCTTCT AGAGCCTTTA ATGAAGGTAG AAGTTGATAC 1550 

TCCTGAAGAC TACATGGGTG ACGTTATTGG TGACTTAAAC CGTCGTCGTG 1600 

CTATCGTTGA AGGCATGGAA GATGGTCCTA 1630 



2) INFORMATION FOR SEQ ID NO: 2230 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1662 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tetragenococcus halophilus 

(B) STRAIN: ATCC 33315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2230 
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TTGAAGTGCA ACGTTCATTG CGTGTGCTAG 
GACTCACAAT CAGGTGTTGA ACCACAAACT 
AACAGAATAT CAAGTACCTC GTATTGTATT 
TTGGCGCAGA TTTCTTATAT TCTGTCCGAT 
GCTAATGCAC AACCTATCCA ATTGCCAATT 
AGGTATCATC GACCTTGTGA AGATGAAAGC 
TAGGGACTAC TTTCGAAGAA ACTGAAATCC 
GCTCAAGAAT GGCATAATAA TTTGGTAGAA 
AGATATCATG ATGAAATACT TGGAAGGTGA 
TACAAGCAGG TATTCGTAAA GCAACATTAT 
TTATGTGGTT CTGCATTTAA AAACAAAGGT 
AGTAATTGAT TACTTGCCTT CTCCAACCGA 
TCGATCCGAA AACAGATGAA GAAACTGAAC 
CCTTTTTCAT CACTTGCTTT TAAAGTTATG 
CTTAACTTTC TTCCGTGTTT ATTCAGGTGT 
TATTGAATGC TACTAAGGGT TCACGTGAAC 
ATGCATGCCA ATTCTCGTTC TGAGATCGAT 
TGCAGCTGCT GTAGGCTTGA AGAACACTAC 
ATGAGAAAAA TCCAGTTATT TTGGAAACTA 
ATTCAAGTTG CTGTTGAACC TAAGTCAAAA 
CGTAGCACTA CAAAAACTTG CAGAAGAAGA 
CCAACGCTGA AACTGGCGAA ACTGTAATTG 
TTAGACGTTC TTATTGACCG TATGAAGACT 
TATTGGTGCA CCACAAGTTT CTTATCGTGA 
AAGCTGAAGG GAAATTTATC CGCCAATCTG 
CACGTATGGG TTGAATTTAC TCCAAACGAA 
TAAAAACTCC ATTGTTGGTG GGGTTGTCCC 
TACAAAAAGG ACTGGAAGAC GCCATGGAAA 
CCATTAGTTG ACGTAAAGGC AGAACTGTTT 
CGACTCTAAT GAAACAGCCT TCCGTATTGC 
AAGCTGCGAA AAAGGCAGAT CCAGTTATTC 
ACAATTAGTA TCCCTGAAGA ATATCTAGGT 
AGCTCGTCGT GGTCGTGTTG AAGGAATGGA 
CTGTAAATGC GT 



ACGGTTCAGT TACTGTCTTG 50 

GAAACAGTTT GGCGTCAAGC 100 

CTGTAATAAA ATGGATAAGG 150 

CATTGCATGA TCGTTTAGAA 200 

GGTGCTGAAG ATAACTTTGA 250 

TGAATTTTAT AAAGATGATT 300 

CAGATGAATA TAAAGAAACA 350 

TCTGTAGCTG ATTTTGATGA 400 

AGAAATTACA CCTGAAGAGT 450 

CTGTTGAATT TTACCCAGTA 500 

GTTCAAATGA TGTTGGATGC 550 

CGTTCCCCCA ATTAAAGGGA 600 

ATCCTGCTGA " TGATAGTGAG 650 

TCAGACCCTT ATGTTGGCCG 700 

GTTGGATACA GGTTCTTATG 750 

GAATTGGTCG TATTTTGCAA 800 

AAGGTTTATT CAGGTGACAT 850 

AACAGGGGAT ACCCTTTGTG 900 

TCAACTTCCC TGAACCAGTA 950 

GCTGACCAAG ATAAAATGAG 1000 

CCCATCTTTT AAAGTGGAAA 1050 

CTGGTATGGG TGAACTTCAA 1100 

GAATTTAAAG TGGATGCCAA 1150 

AACTTTCCGT TCATCAACTA 1200 

GTGGTAGAGG TCAATACGGT 1250 

GAAGGAGCAG GATTCGAATT 1300 

TCGTGACTAT ATACCTGCAG 1350 

ATGGTGTGTT AGCTGGTTAT 1400 

GATGGTTCTT ACCATGACGT 1450 

GGCTTCAATG TCTCTACGTG 1500 

TTGAACCGAT GATGAAAGTA 1550 

GATATTATGG GACATGTTAC 1600 

TGCTCACGGT AATGCACAAA 1650 

1662 



2) INFORMATION FOR SEQ ID NO: 2231 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1652 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Veillonella parvula 

(B) STRAIN: ATCC 10790 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2231 

AAGTAGAACG TTCTCTACGT GTACTTGACG GTTCTGTTGC GGTGTTCAGT 50 

GCTAAAGGTG GCGTTGAACC TCAATCCGAA ACAGTATGGC GTCAGGCTTC 100 

TAACTACGGC GTACCTCGTA TCGCTTATGT AAATAAGATG GATACTGTAG 150 
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GTGCTGACTT CTTCAACGTA GTTGACATGA 
AATTCCGTAG CTATCCAAGT ACCAATCGGT 
CATCATTGAC TTGATGACTA TGAAAGCGGA 
GTAAAGAATA TGAAATCACT GATATCCCTG 
GAAGCTCGTC GCGAAATGAT GATCGATGCT 
TATCATGATG AAATATTTGG AAGGCGAAGA 
AAGCGGCATT GCGTAAAGCT GTTATTGCTA 
TGTGGTTCTT CCTATAAAAA TAAAGGTGTT 
TATCGATTAC ATGCCAGCTC CAATCGACAT 
TTCCTGGTAC TGAAGAAGAA ACAACTCGTC 
TTCTCTGCAT TGGCATTCAA AATCATGGCT 
AGCGTTCTTC CGTGTGTACT CCGGTACATT 
TCAACTCCAC TAAAGGTAAA AAAGAACGTA 
CACGCTAACT CCCGTAAAGA AATCGAACGC 
TGCGGCGGTT GGCTTAAAGG ATACTACTAC 
AAAAATCTCC TGTAATCCTT GAGTCCATGG 
TCCGTTGCTG TTGAACCTAA AACAAAAGCT 
AGCTCTTGCT CGTTTGGCAG AAGAAGATCC 
ATGAAGAAAC AGGTCAAACT ATTATCTCTG 
GATATCATCG TTGACCGTAT GAACCGTGAA 
AGGTAAACCT CAAGTAGCAT ACCGCGAAAC 
CTGAAGGTAA ATTCGTACGT CAATCTGGTG 
TGCTGGTTGG AATTGATTCC TCAAGAACCA 
AAACAAGGTT GTAGGTGGTG CGATTCCTCG 
AAAGCGGTGT TAAAGAAGCT ATGGAATCCG 
ATGGTTGATG TTAAAGTTAT CGTATTTGAT 
CTCCAACGAA ATGGCCTTCA AAATTGCTGG 
GTGCTCGCAA AGCAGACCCT GCATTGCTTG 
GTAGACGTTC CTGAAGAATA CATGGGCGAC 
TCGTCGTGGT CGCATGGACG GCATGGAAGC 
TC 



PCT/CAOO/01150 

TGAAAGCTCG TTTGGGTGCA 200 

GCTGAAGATA CTTTCGAAGG 250 

AATTTATAAA TCCGATGACG 300 

CTGAATATCA AGAAGTAGCA 350 

ATCGCTGAAA CAGATGATGA 400 

AATTTCTGTG GAAGAATTGA 450 

ACCAATTATT CCCAGTTCTT 500 

CAAATGTTAT TGGATGCTGT 550 

CCCACCTATT AAAGGTGTTG 600 

CTTCTTCCGA TGAAGAGCCA 650 

GACCCTTATG TTGGTAAATT 700 

GGAATCTGGC TCCTACGTTT 750 

TCGGTCGTAT TCTTCAAATG 800 

GTATATTCTG GTGACATCGC 850 

AGGCGACACA TTGTGTGATG 900 

AATTCCCTGA ACCAGTTATC 950 

GACCAAGAAA AAATGGGTAC 1000 

TACTTTCAAA GTTCGTACTG 1050 

GTATGGGCGA ACTTCACTTG 1100 

TTCAAAGTAG ATTGTAACGT 1150 

TATCCGTAAA GCTGTTAAGG 1200 

GTCGTGGTCA ATATGGTCAC 1250 

GGTGCTGGCT TCGAGTTTGA 13 00 

TGAATACATC GGACCTGTTG 1350 

GTGTTATCGC TGGGTACCCT 14 00 

GGTTCTTACC ATGACGTTGA 1450 

TTCTATGGGC TTCAAAGAAG 1500 

AACCATATAT GGCTGTAGAA 1550 

GTTATCGGTG ACTTGAACTC 1600 

TCGTAATGGT TCCCAACATA 1650 

1652 



2) INFORMATION FOR SEQ ID NO: 2232 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1624 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2232 

AGTAGAGCGT TCCATGCGTG TTCTTGACGG CGCGGTAATG GTTTACTGTG 50 

CAGTTGGTGG TGTTCAGCCA CAGTCTGAAA CCGTATGGCG TCAGGCTAAT 100 

AAATATAAAG TTCCACGTAT TGCGTTCGTT AACAAAATGG ACCGTATGGG 150 

TGCGAACTTC CTGCGCGTAG TTGGTCAACT GAAATCTCGC CTTGGTGCGA 200 

ACCCAGTTCC ACTGCAGTTG GCAATTGGCG CAGAAGAAAA ATTCACCGGT 250 

ATTATCGATC TGGTGAAAAT GAAAGCGATC AACTGGAACG AAGCTGATCA 300 

GGGCGTGACC TTCGAATATG AAGAAATCCC TGCTGATATG GCTGAACTGG 350 
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CTGCTGAATG GCACCAGAAT CTGGTTGAAT 
GAGCTGATGG ACAAATACTT GGGTGGCGAA 
CAAGAAAGCT TTACGTCAAC GTGTTCTGAA 
CCTGTGGTTC TGCGTTTAAA AACAAAGGCG 
GTTATTGAGT ACCTGCCTGC ACCAACTGAC 
CTTGGATGAT GGCAAAGATA CTCCGGCTGT 
AGCCGTTCTC TGCTCTGGCG TTCAAAATCG 
AACCTGACGT TCTTCCGCGT GTACTCTGGT 
CGTTCTGAAC TCAGTGAAAT CGCAACGTGA 
AGATGCACGC TAACAAGCGT GAAGAGATCA 
ATCGCAGCCG CTATCGGTCT GAAAGATGTG 
TGACCCGAAT AATCCGATCA TCTTGGAACG 
TAATCTCTGT TGCTGTTGAA CCAAAAACCA 
GGTATGGCTC TGGGGCG TTT GGCGAAAGAA, 
GACTGACGAA GAATCTGGTC AGACTATCAT 
ATTTGGATAT CCTGGTTGAC CGTATGCGCC 
AACGTCGGTA AACCTCAGGT TGCGTACCGT 
TAAGGATGTG GAAGGTAAGC ACGCTAAGCA 
ATGGTCATGT TGTTATCGAC ATGTCTCCAT 
TATGAATTCG TCAACGAAAT CGTTGGTGGT 
TCCGGCCGTT GATAAAGGTA TTCAAGAACA 
CAGGTTACCC AGTTGTTGAC GTTAAAGTGC 
CATGACGTTG ACTCCTCAGA ATTGGCATTT 
CTTTAAAGAA GGTTTCAAAC GAGCTAAACC 
TGAAGGTTGA AGTCGAAACC CCTGAAGATT 
GACCTGAACC GTCGTCGCGG TATC 
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CTGCGGCAGA AGCGTCTGAC 4 00 

GAGCTGACCG AAGAAGAAAT 4 50 

AAGCGAAATT ATTCTTGTTA 500 

TACAGGCAAT GCTGGATGCG 550 

GTTGAATCAA TCAACGGCAT 600 

TCGTCATTCT GACGACAAAG 650 

CTACCGACCC ATTCGTGGGT 700 

ATTGTTAATT CCGGTGATAC 750 

ACGCTTAGGT CGTATCGTAC 800 

AAGAAGTTCA CGCCGGTGAT 850 

ACTACGGGTG ACACTTTGTG 900 

TATGGAGTTC CCAGAGCCGG 950 

AAGCTGACCA AGAAAAAATG 1000 

GATCCATCAT TCCGCGTTTG 1050 

CGCTGGTATG GGTGAGTTGC 1100 

GCGAATTTAA CGTGGAAGCA 1150 

GAAACTATCC GCGAAACCGT 1200 

GTCAGGCGGT CGTGGTCAGT 1250 

TGCCACCGGG TGGTGTTGGG 13 00 

TCTATTCCTA AAGAATTCAT 1350 

GCTGAAATCT GGCCCTCTGG 14 00 

GTCTGCACTA CGGTTCTTAC 14 50 

AAATTAGCTG GTTCTATCGC 1500 

AGTTCTGCTT GAGCCAATCA 1550 

ACATGGGTGA CGTAATGGGC 1600 

1624 



2) INFORMATION FOR SEQ ID NO: 2233 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1636 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2233 

GAACGCTCGA TGCGCGTGCT TGACGGTGCT TGCATGGTTT ACTGCGCAGT 50 

GGGCGGTGTT CAGCCACAGT CGGAAACCGT GTGGCGTCAG GCTAACAAGT 100 

ACAAAGTGCC ACGTCTGGCC TTCGTCAACA AGATGGACCG TACCGGCGCC 150 

AACTTCTTCA AGGTGTACGA GCAGATGCGT GCTCGCCTGA AGGCCAACCC 200 

GGTCCTGATC CAGATCCCTA TCGGCGCTGA AGACAACTTC AAAGGCGTGA 250 

TCGATCTGGT CAAGATGAAG GCTATCCTGT GGGACGAAGC GTCGCAAGGC 300 

ATGAAATTCG ACTACGTCGA TATTCCTGCA GAGCTGGCTG ATTCGGCCGC 350 

CGAGTGGCGC GAAAAGATGG TTGAAGCTGC TGCTGAAGCC ACCGAAGAGC 4 00 

TGATGAACAA GTACCTGGAA GAAGGCGACC TGACCGAAGC CGAGATCAAG 4 50 

CAGGCGCTGC GTACCCGTAC CATCGCTTCG GAAATCGTTC CGATGATGTG 500 

CGGTACCGCC TTCAAGAACA AGGGCGTACA GGCCATGCTG GACGCGGTCA 550 

TCGAATACCT GCCATCGCCA CTGGACATCG ACGATGTCGG CGGTACGGAC 600 
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GAAGACGACC AGCCAACCAC CCGTCGCGCA 
GGCGCTGGCC TTCAAGATCA TGACCGACCC 
TCTTCCGCGT GTACTCGGGC GCCGTCAATT 
TCGGTCAAAG GTCGTAAAGA GCGTCTGGGC 
GAATCAGCGC GAAGAGATCA AAGAAGTGCG 
CGGTTGGCCT GAAAGACGTG ACCACGGGCG 
GCCATCATCA CGCTGGAAAA AATGATCTTC 
GGCAGTCGAG CCAAAAACCA AGGCCGACCA 
TGAACCGCCT GGCACAGGAA GATCCTTCGT 
GAATCGGGCC AGACCATCAT CGGTGGWATG 
TATCGTTGAC CGCATGAAGC GCGAATTCGG 
AGCCACAAGT GGCTTACCGC GAAACGATCC 
GAAGGCAAGT TCGTCAAGCA ATCCGGTGGT 
TGTGCTGAAG ATCGAGCCGC AAGAACCAGG 
ACGCCATCAA GGGCGGTACC GTTCCTCGCG 
AAAGGCGTGC GCGGCACCCT GAACACCGGC 
CGTGGACGTC AAGGTCACGG TGTTCTTCGG 
CGAACGAAAA CGCGTTCCAG ATGGCCGCTT 
TGCCGCAAAG CATCGCCAGT CATTCTGGAG 
GGAAACGCCG GAAGACTACG CCGGTACCGT 
GCCGCGGTAT GGTGCAGGGC ATGGACGAAA 



GCTGACGACG AGAAATTCTC 650 

GTTCGTCGGT CAATTGGCCT 700 

CGGGCGACAC CGTGTACAAC 750 

CGTATTCTGC AGATGCACGC 800 

CGCCGGCGAC ATCGCCGCTG 850 

AAACCCTGTG CGATCCGACC 900 

CCTGAGCCTG TGATTCAACA 950 

GGAAAAAATG GGCCTGGCAC 1000 

TCCGCGTGAA GACCGATGAA 1050 

GGCGAGCTGC ACCTGGAAAT 1100 

CGTGGAAGCA ACCGTCGGCA 1150 

GTAAAACCTG CGAAGAATCG 1200 

CGTGGTCAAT ACGGTCACGT 1250 

CAAGGGCTTC GAGTTCGTTG 1300 

AGTACATCCC TGCGGTGGAA 1350 

GTGCTGGCTG GTTACCCGGT 1400 

TTCGTACCAC GATGTGGACT 1450 

CGATGGCATT CAAAGAAGGC 1500 

CCAATGATGG CTGTGGAAGT 1550 

GATGGGCGAC CTGTCGTCCC 1600 

TCCCAG 1636 



2) INFORMATION FOR SEQ ID NO: 2234 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aeromonas hydrophila 

(B) STRAIN: ATCC 7966 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2234 

GGCACGAACG ATCTTGCCGG ACGGGCCATC TTCCATACCT TCGATCAGGC 50 

CACGACGACG GTTCAGGTCA CCGATCACGT CGCCCATATA GTCTTCCGGA 100 

GTCTCGACTT CTACTTTCAT GATCGGTTCG AGCAGAACCG GGTTGGCCTT 150 

CATGAAGCCA GCCTTAAAGG CCATGGAAGC AGCGATCTTG AACGCCAGTT 200 

CGGAAGAGTC GACATCGTGG TAGGAACCGA AGTGCAGACG CACGCCCAGA 250 

TCCATAACCG GATAACCTGC CAGCGGGCCG GACTTCAGTT GCTCGCGGAT 300 

ACCCTTGTCA ACACCCGGGA TGAACTCACC AGGAATGACG CCGCCCTTGA 350 

TGTCGTTGAC GAATTCGTAG GCTTTGCCTT CTTCCAGCGG GTACATGTCG 4 00 

ATCACAACGT GACCGTACTG ACCACGACCA CCGGACTGCT TGGCGTGCTT 450 

ACCTTCGATA TCCTTGACGG TGTTACGAAT GGTTTCACGG TAGGCAACCT 500 

GCGGCTTACC TACGTTCGCT TCTACCTTGA ACTCGCGACG CATACGGTCA 550 

ACGATGATGT CCAGGTGCAG CTCACCCATA CCGGCGATGA TGGTTTGGCC 600 

AGACTCTTCG TCAGTCCATA CGCGGAAGGA CGGGTCTTCC TGAGCCAGAC 650 

GGCCCAGAGC CAGGCCCATC TTCTCTTGGT CAGCCTTGGT TTTCGGCTCA 700 

ACTGCGATGG AGATTACCGG TTCCGGGAAT TCCATACGCT CGAGGATGAT 750 

CGGCGCTTTT TCGTCACACA GGGTGTCACC GGTGGTCACG TCTTTCAGAC 800 

CAATGGCGGC AGCGATGTCG CCTGCGCGAA CTTCTTTGAT CTCTTCACGC 850 
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TTGTTGGCGT GCATCTGAAC 
AACGGAGTTC AGCACGGAGT 
GGAAGAAGGT CAGGTTGCCT 
AGAGCAGCAA ACGGCTCGTC 
GCCATCCAGC TTCAGACCGT 
GATAGTCAAC CACGGCATCC 
GCGGAGCCAC AGGTAACCAG 
AAGAGCTTTC TTGATCTCTT 
TTTCCATCAG GTCTTCAGAC 
CGCATTTCTT GCGCTTTTTC 
GAAGGATACG CCCTGGTCAG 
CCAGGTCGAT AACGCCCTTG 
AGCGGAACCG GGTTACCTTT 
CAGGAAGTTG GCACCGGTAC 
GAACCTTGTA CTTGTTAGCC 
ACGCCACHTA CGGCACAGTA 
GGAACG 



GATACGGCCA AAACGCTCGC 
CACCGGAGTT AACCACACCG 
ACGAACGGGT CGGTAGCAAT 
ATCAGAAGCA TGACGCTCGT 
CGATGGCTGC TACGTCGGTC 
AGCATGGCCT GTACGCCCTT 
GATGATTTCG TTGTTCAGAA 
CCTCGGTCAG TTCTTCACCA 
GCTTCAGCAG CGGCTTCAAC 
CAGCAGCTCA GCCGGGACGT 
CTTCGCTCCA GTTGATGGCT 
AAGTTCTCTT CTGAACCGAT 
CAGACGGGTC TTGATGTGCT 
GGTCCATCTT, GTTGACGAAC 
TGACGCCATA CGGTTTCAGA 
CACCATCACG GCGCCGTCCA 



GCTTCTCTTT 900 

GAGTAAACGC 950 

CTTGAATGCC 1000 

CTTTGGTCTC 1050 

GGCGCCGGCA 1100 

GTTCTTGAAC 1150 

CACGCTGACG 1200 

CCCAGGTATT 1250 

CAGGGTCATG 1300 

CTTCGTAATC 1350 

TTCATCTTGA 14 00 

GTTCAGTTGC 14 50 

CAACGCAGCG 1500 

GCGATACGGG 1550 

CTGTGGCTGT 1600 

GAACACGCAT 1650 
1656 



2) INFORMATION FOR SEQ ID NO: 2235 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia 

(B) STRAIN: ATCC 49175 

(xi) SEQUENCE DESCRIPTION: SEQ 

TCTCCTCCCC ATTTGATAAC TACCAAATGA 
CAGTCATAAC CAGTTGATTT TTAAGAGAGT 
GGTAGATACT GTTATAGAAT CTAACAAAAC 
TTTAA 



adiacens 



ID NO: 2235 

ACGCTATCGA CTGGTTATGT 50 
TCTTTGGTAT AATTACAATC 100 
TCAATTAATA GGAGGAATCA 150 

155 



2) INFORMATION FOR SEQ ID NO: 2236 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: ATCC 19606 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2236 

TCTTTCGATT ACTATAAGCC CTAACTAATT CTTAGTTAAA AACCAAGTGC 50 
TCATGGAGCG ACCCTCCATG AGTAGTTAAT AAAGGAAGAT CATC 94 



2) INFORMATION FOR SEQ ID NO: 2237 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

MOLECULE TYPE: Genomic DNA 
ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2237 

GTCCCACCGG GGGTGAGTGG GGGCCGGGCA ATCGGCCCCT GCGAGCCGCC 50 
TCACACACCA GTAATCCCAG TAGGTCTCAT GCCCCATGGG TGTTAAAGTG 100 
ACACCTAGCC GTAGGCTGAG AATTTCTACC CGAGTCCAGG AGGACGAAAA 150 



(ii) 
(vi) 



2) INFORMATION FOR SEQ ID NO: 2238 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

■ (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2238 
TTCAGTCCTT TAGGCAAGGA GTTAATTGTC 30 



2) INFORMATION FOR SEQ ID NO: 2239 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) " TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2239 

CAACTAGCCT CTCGCTTATC TCCCTATGCA AAATTTCTAG TCAGTTACAG 50 

GGCATAGGGG GAGCGTAGGC GGGGGTAGCG GCTTGCTGAG CACTTCCTCT 100 

ACATCAAAGG GAATGTTGAG CCGGCCGTTA CCCTGTACGA TCCCATCTGG 150 

TTTCTTCGGT GGTTTGATAA ATACCCCGTT GTGACCCTAG GATCATGTAA 200 

CTGGCACAAT GTAAATAGCT GTACTGCCAG GCTGCCGAAT TAGCAGTCAG 250 

AAATGTACAG CACTGTCAAC TCGTGGCTGC GAAATCGTAG CCACCACGAA 3 00 

GTCCAGGAGG ACACACA ' 317 



2) INFORMATION FOR SEQ ID NO: 2240 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2240 

TCCACAGGAT TAAAACCTAA GTCCCGTGCT CTCTCCTGAG GGGAGAGCAC 50 
TATAGTAAGG AATATAGCC 69 



2) INFORMATION FOR SEQ ID NO: 2241 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2241 

TCCACAGGAT TAAAACCTAA GTCCCGTGCT CTCTCCGAAG GGGAGAGCAC 50 
TATAGTAAGG AATATAGCC 69 
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2) INFORMATION FOR SEQ ID NO: 2242 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2242 

AACAACAAAG AAGATTAATT GATTTTTTTC GCAACATCAA GTATAACTTT 50 
AGTTAGAAGT ATTACTTAGT TTAAATTTAA GCTAAGTAAA AAATAATTAT 100 
CGAATTATCG AGGAGGATAT TTTAAA 126 



2) INFORMATION FOR SEQ ID NO: 2243 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 

(B) STRAIN: ATCC 25291 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 3 

GTGTCAACTC ACTGGCTCGG AGCCGAGCAA TCGGCTCAGC GAAGGCGACG 50 

GGTCAGTAGC TACTGGCAGC GGAGTAATCT TGCCGGGTCA TTGGAATGCC 100 

TTGGGCGCGG CACAACTGAA AACACCAACA CTGCTTTAAC AAGCACCAAC 150 

TAGTCCAGGA GGACACAGAA 170 



2) INFORMATION FOR SEQ ID NO: 2244 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: Mgor-1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2244 



GGCCGGCAAG CCTGCGAGTA AGCTGACGCG GTTAGCACCG CGGCAAAACC 
AAGAAAAATC AACACTGCTT TTTTAAGCAC CAACAGTCCA GGAGGACAAC 
AAA 



50 
100 
103 



2) INFORMATION FOR SEQ ID NO: 2245 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium kansasii 

(B) STRAIN: Mkan-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2245 

GCCGAGCGTT GCGCGTAAGC TAGCTCGGTT ACCACGGCGG CAAAACTAGA 50 
AAAACATCAA CACTGCTTTT ATAAGCACCA ACAGTCCAGG AGGACACAGA 100 
A 101 



2) INFORMATION FOR SEQ ID NO: 2246 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium terrae 

(B) STRAIN: Mter-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2246 

CCAGCTTCCC GCCCGTGCCG CTACGGTGGC ACAAACCCAA AAAGATCAAT 50 
CCTGCTGTAA CCCAGCACCA ACAAGTCCAG GAGGACAAGA A 91 



2) INFORMATION FOR SEQ ID NO: 2247 

1154 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2247 

TCAGGCAAAT AGGCCGTCTG AAAGGCTGAA ATGATTTTTC AGACGGCATT 50 
GTTCTTTAAT CGATCTTTAA. TGTAAAGGAA TTAGCTC 87 



2) INFORMATION FOR SEQ ID NO: 2248 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2248 

TATAACTTGT TAAGACTAGC TATGCTAGGT TAAAATACAG GTTGAGCTTA 50 

TTTATAAGCT GACATTTTTA TGATTTGATT TTTAGGGGTA AATGCATTAT 100 

AAAAGAATTA TAAATTCTTT TATGCTACAC TCAATCAATT TTCTTCTCAT 150 

GATGGTGAGA AACTATCATG AGAGATAAAT TTGAAATAAC TTTTATTAAG 200 

AATAGGAGAG ATTTAATA 218 



2) INFORMATION FOR SEQ ID NO: 2249 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 9 



TTAAACTTGT TTTAGCTAGA ATTTCTAGGT AAAATACAGC GTAAGCTTAT 
TAATTAAGCT AACATCTTTA TGAATTGATT TTTTACTGAA AATGCATTAT 
AAATGAATTA TGAATTCTAA CAATCATTAT GTCTCATGAT GGTGAGAAAC 
TATCATGAGA GATAATATTG AAATAACTTT TACTAGAATA GGAGAGATTT 
AATA 



50 
100 
150 
200 
204 



2) INFORMATION FOR SEQ ID NO: 2250 

(i) SEQUENCE CHARACTERISTICS : , 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2250 
CCGTCGGATG GTGTCGTATA CCGCGGAGTC GCCGACGG 38 



2) INFORMATION FOR SEQ ID NO: 2251 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2251 
CGGAGCCGTT CTCGCTGCGT TACATGCTGG TGGCTCCG 38 



2) INFORMATION FOR SEQ ID NO: 2252 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2252 
CCACATACAG TGTCTCTC 18 
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2) INFORMATION FOR SEQ ID NO: 2253 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2253 
CATTACCCAA CCGAAAGTA 19 



2) INFORMATION FOR SEQ ID NO: 2254 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2254 
ACCTGAACAG AGAGAAATG 19 



2) INFORMATION FOR SEQ ID NO: 2255 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2255 

ATGGCAAAAG AAAAATTTGA CCGTTCAAAA CCACACGTTA ACATTGGTAC 50 

AATCGGCCAC GTTGACCACG GTAAAACAAC ATTAACTGCT GCTATCACAA 100 

CTGTTTTAGC TAAGAAAGGT TTCGCGCAAG CTCAAGATTA CGGTTCAATC 150 

GATAAAGCTC CAGAAGAACG CGAACGTGGT ATCACAATCA ACACTTCTCA 200 

CGTTGAGTAC GAAACAGACA CTCGTCACTA TGCTCACGTT GACTGCCCAG 250 

GACACGCGGA CTACGTTAAA AAC 273 
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2) INFORMATION FOR SEQ ID NO: 2256 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: ATCC 19606 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2256 

ATGGCTAAAG CCAAGTTTGA ACGTAATAAA CCACACGTAA ACGTGGGTAC 50 

AATCGGTCAC GTTGACCATG GTAAAACAAC TTTAACTGCT GCGATTGCAA 100 

CAATTTGTGC AAAAACTTAC GGCGGTGAAG CGAAAGATTA CTCACAAATC 150 

GACTCAGCAC CTGAAGAAAA AGCACGTGGT ATTACAATTA ATACATCACA 200 

CGTAGAATAC GATTCTCCAA CTCGTCACTA CGCACACGTT GACTGCCCAG 250 

GCCACGCCGA CTACGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2257 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces 

(B) STRAIN: ATCC 35568 

(xi) SEQUENCE DESCRIPTION: SEQ 

GTGGCGAAGG CCAAGTTTGA GCGCACCAAG 
GATTGGTCAC GTTGACCACG GCAAGACGAC 
AGGTGCTGCA TGACAAGTAC CCCGAACTGA 
CAGGTCGACA ACGCTCCCGA GGAGCGCGAT 
CTCTCACGTT GAGTACCAGA CCGAGGCGCG 
CTCCCGGCCA CGCCGACTAC GTCAAGAAC 



meyeri 



ID NO: 2257 

CCGCACGTCA ACATCGGCAC 50 

GCTGACGGCA GCTATCACCA 100 

ACGAGTTCAC CCCCTTCGAT 150 

CGTGGCATCA CGATCAACGT 200 

TCACTACGCG CACGTTGACG 250 

279 



2) INFORMATION FOR SEQ ID NO: 2258 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 273 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ 

GTGGCTAAAG AAAAATTTGA TCGTTCCCTA 
TATCGGTCAC GTTGACCATG GTAAAACCAC 
GCGTTTGCTC CGAAGTATTC GGTTCCGCAA 
GACAGCGCAC CAGAAGAAAA AGCTCGTGGT 
CGTTGAATAC AACTCGCTGA, TCCGTCACTA 
GTCACGCTGA CTATGTGAAG AAC 



difficile 



ID NO: 2258 

CCGCACGTCA ACGTTGGCAC 50 

TCTGACTGCT GCTCTGACTC 100 

TCGTTGATTT CGATAAAATC 150 

ATCACCATCA ACACCGCGCA 200 

CGCTCACGTT GACTGCCCAG 250 

273 



2) INFORMATION FOR SEQ ID NO: 2259 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2259 

GTGGCAAAGG CTAAGTTCGA GCGTACCAAG CCGCACGTCA ACATCGGCAC 50 

CATCGGTCAC GTTGACCACG GTAAGACCAC CACCACCGCT GCTATCACCA 100 

AGGTTTTGGC AGACGCTTAC CCAGAGCTGA ACGAAGCTTT CGCTTTCGAT 150 

GCCATCGATA AGGCACCGGA AGAGAAAGAG CGTGGTATTA CCATCAACAT 200 

CTCCCACGTG GAGTACCAGA CCGAGAAGCG CCACTACGCA CACGTTGACG 250 

CTCCAGGTCA CGCTGACTAC ATCAAGAAC 279 



2) INFORMATION FOR SEQ ID NO: 2260 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Enterobacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2260 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCATG GTAAAACTAC CCTGACTGCT GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGTGGTTCTG CTCGTGCATT CGACCAGATC 150 

GATAACGCAC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACCTCTCA 200 

CGTTGAATAT GACACCCCGA CTCGCCACTA CGCACACGTA GACTGCCCAG 250 

GTCACGCCGA CTATGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2261 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2261 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCATG GTAAAACTAC CCTGACTGCT GCCATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGTGGTTCCG CTCGCGCATT CGACCAGATC 150 

GATAACGCGC CGGAAGAAAA AGCTCGTGGT ATCACCATCA ACACCTCTCA 200 

CGTTGAATAT GACACCCCGA CTCGCCACTA CGCGCACGTA GACTGCCCGG 250 

GCCACGCCGA CTATGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2262 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2262 

ATGGCAAAAG AAAAATTTGA CCGCTCTAAA CCCCATGTTA ACATTGGTAC 50 
TATTGGACAC GTTGACCATG GTAAAACAAC TTTAACTGCT GCAATTACAA 100 
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CTGTACTTGC TAAAAAAGGC TATGCTGATG CACAAGCTTA TGACCAAATT 
GATGGTGCTC CAGAAGAAAG AGAACGTGGA ATCACAATCT CTACTGCTCA 
CGTTGAGTAC CAAACTGACA GCCGTCACTA TGCACACGTT GACTGCCCAG 
GACATGCCGA TTACGTTAAA AAC 



150 
200 
250 
273 



2) INFORMATION FOR SEQ ID NO: 2263 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 

(B) STRAIN: ATCC 25291 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2263 

GTGGCGAAGG CGAAGTTCGA GCGGACGAAG CCGCACGTCA ACATCGGGAC 50 

CATCGGTCAC GTTGACCACG GCAAGACCAC GCTGACCGCG GCTATCACCA 100 

AGGTTCTGCA CGACAAGTAC CCGGACCTGA ACGAGTCCCG CGCGTTCGAC 150 

CAGATCGACA ACGCGCCCGA GGAGCGTCAG CGCGGTATCA CCATCAACAT 2 00 

CTCCCACGTG GAGTACCAGA CCGACAAGCG GCACTACGCT CACGTCGACG 250 

CCCCGGGTCA CGCCGACTAC ATCAAGAAC 2 79 

2) INFORMATION FOR SEQ ID NO: 2264 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium go r dona e 

(B) STRAIN: Mgor-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2264 

GTGGCGAAGG CGAAGTTCCA GCGGACCAAG CCGCACGTCA ACATCGGGAC 50 

CATCGGTCAC GTTGACCACG GCAAGACCAC TCTGACCGCG GCTATCACCA 100 

AGGTCCTGCA CGACAAGTAC CCGGACCTGA ACGAGTCCAA GGCGTTCGAC 150 

CAGATCGACA ACGCGCCTGA GGAGCGTCAG CGCGGTATCA CGATCAACAT 200 

CGCGCACGTG GAATACCAGA CCGAGAAGCG TCACTACGCG CACGTCGACG 250 

CCCCCGGCCA CGCCGACTAC ATCAAGAAC 279 



2) INFORMATION FOR SEQ ID NO: 2265 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium kansasii 

(B) STRAIN: Mkan-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2265 

GTGGCGAAGG CGAAGTTCCA GCGGACCAAG CCCCACGTCA ACATCGGGAC 50 

CATCGGTCAC GTTGACCACG GCAAGACCAC CCTGACCGCG GCTATCACCA 100 

AGGTCCTGCA TGACAAGTTC CCGGACCTGA ACGAGTCGAA GGCGTTCGAC 150 

CAGATCGACA ACGCTCCTGA GGAGCGCCAG CGCGGTATCA CGATCAACAT 200 

CGCGCACGTG GAGTACCAGA CCGAGAAGCG GCACTATGCA CACGTCGACG 250 

CGCCGGGCCA CGCCGACTAC ATCAAGAAC 279 



2) INFORMATION FOR SEQ ID NO: 2266 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium terra e 

(B) STRAIN: Mter-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2266 

GTGGCGAAGG CGAAGTTCGA GCGGACGAAG CCGCACGTCA ACATCGGGAC 50 

CATCGGTCAC GTTGACCACG GCAAGACCAC GCTGACCGCG GCTATCACCA 100 

AGGTTCTGCA CGACAAGTAC CCGGACCTCA ACGAGTCGCG TGCGTTCGAC 150 

CAGATCGACA ACGCTCCCGA AGAGCGTCAG CGCGGTATCA CCATCAACAT 200 

CTCCCACGTG GAGTACCAGA CCGAGAAGCG GCACTACGCC CACGTCGACG 250 

CTCCTGGTCA CGCTGACTAC ATCAAGAAC 279 



2) INFORMATION FOR SEQ ID NO: 2267 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2267 

ATGGCTAAGG AAAAATTCGA ACGTAGCAAA CCGCACGTAA ACGTTGGCAC 50 

CATCGGTCAC GTTGACCATG GTAAAACCAC TCTGACTGCT GCTTTGACTA 100 

CTATTTTGGC TAAAAAATTC GGCGGTGCTG CAAAAGCTTA CGACCAAATC 150 

GACAACGCAC CCGAAGAAAA AGCACGCGGT ATTACCATTA ACACCTCGCA 2 00 

CGTAGAATAC GAAACCGAAA CCCGCCACTA CGCACACGTA GACTGCCCGG 250 

GTCACGCCGA CTACGTTAAA AAC 2 73 



2) INFORMATION FOR SEQ ID NO: 2268 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2268 

ATGGCAAAAG AAAAATTTGA TCGCTCAAAA GAACATGCCA ATATTGGTAC 50 

TATCGGTCAC GTTGACCATG GTAAAACAAC TTTAACAGCT GCTATCGCAA 100 

CTGTATTAGC TAAAAATGGT GACACTGTTG CACAATCATA CGATATGATT 150 

GACAACGCTC CAGAAGAAAA AGAACGTGGT ATTACAATCA ATACTGCACA 20 0 

TATCGAATAC CAAACTGACA AACGTCACTA TGCTCACGTT GACTGCCCAG 250 

GACACGCTGA CTATGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2269 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2269 
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ATGGCAAAAG AAAAATTTGA TCGCTCAAAA GAACATGCCA ATATTGGTAC 
TATCGGTCAC GTTGACCATG GTAAAACTAC TTTAACAGCT GCTATCGCAA 
CTGTATTAGC TAAAAATGGT GACACTGTAG CACAATCATA TGACATGATT 
GACAACGCTC CAGAAGAAAA AGAACGTGGT ATCACAATCA ATACTGCACA 
CATCGAGTAT CAAACTGACA AACGTCACTA TGCTCACGTT GACTGCCCAG 
GACACGCTGA CTATGTTAAA AAC 



50 
100 
150 
200 
250 
273 



2) INFORMATION FOR SEQ ID NO: 2270 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double ' 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aeromonas hydrophila 

(B) STRAIN: ATCC 7966 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2270 

CGATCCTGGT AGTAGCAGCG ACTGACGGCC CGATGCCGCA GACTCGTGAG 50 

CACATCCTGC TGGGTCGTCA GGTAGGCGTT CCGTACATCA TCGTGTTCAT 100 

GAACAAGTGT GACATGGTAG ATGACGAAGA GCTGCTGGAA CTGGTCGAGA 150 

TGGAAGTTCG CGAACTGCTG TCCGAGTACG ACTTCCCGGG TGATGACCTG 200 

CCGGTAGTCC GTGGTTCYGC ACTGAAAGCG CTGGAAGGCG AAGCTCAGTG 250 

GGAAGAGAAG ATCCTGGAAC TGGCTGGCCA CCTGGACACC TACATTCCGG 300 

AGCCGGAGCG TGCCATCGAC CTGCCGTTCC TGATGCCTAT CGAAGACGTA 350 

TTCTCCATCG CTGGCCGYGG TACCGTAGTG ACCGGTCGTG TAGAGCGCGG 4 00 

TATCGTCAAA GTTGGTGAAG AAGTGGAAAT CGTKGGTATC AAAGATACCA 4 50 

CCAAGACCAC CTGTACCGGC GTTGAAATGT TCCGCAAACT GCTGGACGAA 500 

GGTCGTGCAG GCGAGAACAT CGGTGCACTG CTGCGTGGCG TGAAGCGTGA 550 

AGACGTAGAG CGTGGTCAGG TACTGGCCAA GCCGGGCACC ATCAAGCCGC 6 00 

ACACCAAGTT YGAATCTGAA GTGTACGTGC TGTCCAAAGA AGAAGGTGGT 650 

CGTCATACCC CGTTCTTCAA AGGCTACCGT CCGCAGTTCT ACTTCCGTAC 7 00 

TACCGACGTG ACCGGTACCA TCGAACTGCC GGAAGGCGTA GAGATGGTAA 750 

TGCCGGGCGA CAACATCAAG ATGGTTGTTA CCCTGATTGC GCCGATCGCG 800 

ATGGACGACG GC 812 



2) INFORMATION FOR SEQ ID NO: 2271 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

'(A) ORGANISM: Bilophila wadsworthia 
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(B) STRAIN: ATCC 49260 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2271 



CGACGGTCCC 
TCGGCGTGCC 
GACCCCGAAC 
CTCCTACGGC 
TGAAGGCTCT 
CTCGAACTGC 
AACCGACAAG 
GCCGCGGTAC 
GGCGAAGAAG 
CACCGGCGTC 
ACAACATCGG 
GGCCAGGTTC 
GGCTGAAGTG 
TCTTCACCGG 
GGTATCATCG 
CGCTACCTTT 



ATGCCCCAGA 
TCACCTCGTC 
TGCTCGAACT 
TACCCCGGCG 
GGAATCCGAT 
TCGCCGCTTG 
CCCTTCCTGA 
CGTGGTCACC 
TCGAAATCGT 
GAAATGTTCC 
CGCTCTGCTC 
TCGCCGCTCC 
TACGTTCTGT. 
CTATCGTCCT 
CTCTTGAAGA 
AATGTCGAGC 



CCCGTGAGCA CATCCTGCTC GCCCGTCAGG 
GTGTTCATGA ACAAGTGTGA CCTCGTCGAC 
CGTCGAAATG GAAGTCCGCG AACTGCTGAG 
ATGAAATCCC GGTTGTCCGC GGTTCCGCTC 
AGCGCTGATT CCCCTGACGC CCAGTGCGTG 
CGACAGCTAC TTCCCGGATC CGGTCCGCGA 
TGCCCATCGA AGACGTGTTC TCCATCTCCG 
GGTCGTGTGG AACGTGGCAT CATCAAGGTC 
GGGTATCCGT CCCACCGTGA AGACGACCTG 
GCAAGCTGCT CGATCAGGGC CAGGCCGGCG 
CGCGGCACGA AGCGTGACGA AGTGGAACGC 
CAAGAGCATC ACGCCCCACA AGAAGTTCAA 
CCAAGGAAGA AGGCGGCCGC CATACCCCGT 
CAGTTCTACT TCCGTACCAC CGACATCACC 
AGGCGTTGAA ATGGTTATGC CCGGCGATAA 
TCATTCACCC CATCGCCATG GAAAAGGGC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
799 



2) INFORMATION FOR SEQ ID NO: 2272 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 786 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brevundimonas diminuta 

(B) STRAIN: ATCC 11568 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2272 

GATCCTGGTG TGCTCGGCCG CTGACGGCCC GATGCCGCAG ACCCGCGAGC 50 

ACATCCTGCT GTCGCGTCAG GTCGGCGTTC CGGCGCTGGT GGTGTTCCTG 100 

AACAAGGTCG ACATGGTCGA CGACGAGGAR CTGCTGGAGC TGGTCGAGAT 150 

GGAAGTGCGC GAGCTGCTGA GCTCGTACCA GTTCCCGGGC GACGACATCC 2 00 

CGGTGGTCAA GGGCTCGGCC CTGGCCGCGG TGGAGGATCG CGACCCGCAC 250 

ATCGGCGCCG AGCGCGTTCT GGAGCTGATG GCGGCGGTCG ACAGCTACAT 3 00 

CCCGCAGCCG GAACGTCCGA TCGACATGCC GTTCCTGATG CCGGTGGAAG 3 50 

ACGTGTTCTC GATCTCGGGC CGCGGCACCG TGGTGACGGG TCGCGTCGAG 4 00 

CGCGGCGTCG TCAAGGTCGG TGAAGAAGTC GAAATCGTCG GCATCCGTCC 4 50 

GGTTCAGAAG ACGACCTGCA CGGGCGTCGA AATGTTCCGC AAGCTGCTGG 500 

AYCAGGGTCA RGCCGGCGAC AACGTGGGCG TGCTGCTGCG CGGCACCAAG 550 

CGTGAAGACG TCGAGCGCGG CCAGGTGCTG TGCAAGCCGG GTTCGATCAC 600 

CCCGCACACC AAGTTCGTGG CTGAAGCCTA CATCCTGAAC AAGGAAGAAG 650 

GCGGCCGTCA CACGCCGTTC TTCACGAACT ACCGTCCGCA GTTCTACTTC 700 

CGCACGACGG ACGTGACCGG CATCGTGCGC CTGAAGGAAG GCGTCGAGAT 750 

GATCATGCCG GGCGACAACG CCGAGCTGGA CGTCGA 786 
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2) INFORMATION FOR SEQ ID NO: 2273 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 bases 

(B) TYPE: Nucleic acid 

(C) STRAN DEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2273 



GCTATTATGG 
GTACCTGCCG 
ATTCCTCAAT 
CGATTTCAAG 
GAAAAAATGG 
CAAACCGCAA 
TCTTTTCAAA 
ATGACAGACA 
TGCAATCCCA 
CAGATGATGA 
GTCACACCAG 
TGTTTGGACA 



CTGGATTACT 
TCAAATCGCT 
GGCCTTGGTA 
TAATACAAGC 
CTGCCGCTTA 
TACGTCAACC 
TGGCGGATCA 
TGATGAAGAC 
GGAGTTTATC 
GCTAGAGAAG 
ACGAACTATT 



ATCCAATACG 
GGAAGCAGTC 
TCAATTATCC 
GAATCTGGTA 
CGCTGCCTTT 
GAGTTGTCTT 
AAAGCCATGA 
CGTTCTTCAG 
AAGCAGGTAA 
TTGACAAAAC 
TGTTGGCTAT 



CTATCCAAGA 
GGATTAGATA 
TGAGATGCAT 
ACCAATACGG 
GCTAATGGCG 
TAGCGACGGT 
AAGAGACGAC 
TCTGGAACTG 
AACTGGTACT 
CTTATTACAG 
ACTCCACAAT 



ATCCCGTAAC 
ATTCATTGAA 
TATTCTAATG 
AGCAAGTAGC 
GTACATATTA 
ACAGAAAAAG 
AGCCTACATG 
GTACCAATGC 
TCCAACTATG 
TTCTAGCATT 
ACTCTATGGC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
560 



2) INFORMATION FOR SEQ ID NO: 2274 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2274 

GCTACTTTGG AAACATTACT GTCCAATATG CTCTTCAACA ATCACGTAAT 50 

GTCACAGCCG TTGAAACTTT GAATAAGGTC GGTCTAGATA AGGCTAAAGC 100 

CTTCCTTAAT GGGCTTGGTA TTGATTATCC AAGCATGCAT TATGCAAACG 150 

CCATTTCAAG TAATACAACT GAATCCAACA AAAAATACGG TGCAAGTAGT 200 

GAAAAAATGG CTGCTGCCTA CGCTGCTTTT GCTAATGGTG GTATTTACCA 2 50 

CAAGCCAATG TACATCAATA AAATCGTCTT TAGCGACGGT AGTGAGAAAG 300 

AATTTTCTGA TGCCGGCACA CGAGCTATGA AAGAAACTAC TGCCTATATG 350 

ATGACTGAAA TGATGAAAAC AGTCCTAGTA TACGGTACCG GACGTGGAGC 4 00 

CTACCTACCA TGGCTTCCAC AAGCAGGTAA GACAGGTACT TCTAACTATA 4 50 
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CTGACGACGA AATTGAAAAG TATATCAAGA ACACTGGCTA CGTAGCCCCA 500 
GATGAAATGT TTGTAGGGTA TACTCGTAAA TATGCAATGG CTGTTTGGAC 550 
A 551 



2) INFORMATION FOR SEQ ID NO: 2275 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear. 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2275 



GCTATTATGG CTGGATTACT ATCCAATACG 
GTACCAGCCG TCAAATCGCT GGAAGCAGTC 
GTTCCTCAAT GGCCTTGGTA TTAATTACCC 
CGATTTCAAG TAATACAAGC GAATCTGGTA 
GAAAAAATGG CTGCCGCTTA CGCTGCCTTT 
CAAACCGCAA TACGTCAACC GAGTTGTCTT 
TCTTTTCAAA TGGCGGATCA AAAGCCATGA 
ATGACAGACA TGATGAAGAC CGTTCTTCAA 
TGCAATTCCA GGAGTCTATC AAGCAGGTAA 
CAGATGATGA ACTAGAGAAG TTGACAAAAC 
GTCACACCAG ACGAGCTGTT TGTTGGCTAC 
TGTTTGGACA 



CTATCCAAGA ATCCCGTAAC 50 

GGATTAGATA ATTCATTGAA 100 

TGAAATGCAT TATTCTAATG 150 

ACCAATACGG AGCAAGTAGC 20 0 

GCTAATGGCG GTACATATTA 250 

TAGCGACGGT ACAGAAAAAG 300 

AAGAAACGAC AGCCTACATG 350 

TCTGGAACTG GTACCAATGC 4 00 

AACCGGCACT TCCAACTATG 4 50 

CTTATTACAG TTCTAGCATT 500 

ACTCCACAGT ACTCTATGGC 550 

560 



2) INFORMATION FOR SEQ ID NO: 2276 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) ♦ TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2276 

TTATTTTGGA AATATCACCA TCCAATATGC GCTCCAACAA TCACGGAACG 50 

TTACAGCCGT AGAAACCTTG AACAAAGTCG GTTTGGATAG AGCCAAGACC 100 

TTCCTGAATG GAATCGGTAT TGACTATCCA GATATGCACT ATGCCAACGC 150 

GATTTCAAGT AATACGACTG AGTCAAACAA AAAGTACGGA GCAAGTAGTG 200 
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AGAAAATGGC TGCTGCTTAC GCTGCTTTTG 
AAACCAATGT ATATCAACAA AATCGTCTTT 
ATACGCTGAT CCTGGTACTC GTGCCATGAA 
TGACAGAAAT GATGAAGACT GTCTTGGCAT 
TATCTCCCTT GGCTACCTCA AGCTGGTAAG 
AGATGATGAA ATTGAAAACT ACATCAAAAA 
ACGAAATGTT TGTTGGTTAT ACTCGCAAAT 



CTAACGGTGG TATCTACCAT 250 

AGCGATGGTA GCTCAAAAGA 300 

AGAGACGACC GCCTATATGA 350 

ACGGAACGGG TCGTGGTGCT 4 00 

ACTGGTACAT CAAACTATAC 450 

TACTGGTTAT GTAGCCCCAG 500 

ATTCAATGGC TGTWTGGACA 550 



2) INFORMATION FOR SEQ ID NO: 2277 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 35401 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2277 



GCTCCTATCT GGATTATGCG 
GATGTCCGAG ATGGCCTGAA 
GAACGTACTA GGCAATGACT 
TCGTTGGTGA CGTAATCGGT 
TATGACACGA TCGTCCGCAT 
GGTAGACGGT CAGGGTAACT 
CAATGCGTTA TACGGAAATC 
GCCGAT 



ATGTCGGTCA TTGTTGGCCG 
GCCGGTACAC CGTCGCGTAC 
GGAACAAAGC CTATAAAAAA 
AAATACCATC CCCATGGTGA 
GGCGCAGCCA TTCTCGCTGC 
TCGGTTCTAT CGACGGCGAC 
CGTCTGGCGA AAATTGCCCA 



TGCGCTGCCA 50 

TTTACGCCAT 100 

TCTGCCCGTG 150 

CTCGGCGGTC 200 

GTTATATGCT 250 

TCTGCGGCGG 300 

TGAACTGATG 350 
356 



2) INFORMATION FOR SEQ ID NO: 2278 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 347 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 23511 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2278 

CTCTTATCTG GATTATGCGA TGTCGGTCAT TGTTGGCCGC GCGCTGCCGG 50 

ATGTCCGAGA TGGCCTGAAG CCGGTACACC GTCGCGTACT TTACGCCATG 100 

AACGTATTGG GCAATGACTG GAACAAAGCC TACAAAAAAT CAGCCCGTGT 150 

CGTTGGTGAC GTGATCGGTA AATACCACCC GCACGGCGAC TCCGCGGTAT 200 
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ATGACACCAT CGTTCGTATG GCGCAGCCGT TCTCGCTGCG CTACATGCTG 250 
GTGGATGGCC AGGGGAACTT CGGTTCAATC GACGGCGACT CCGCCGCGGC 3 00 

AATGCGTTAT ACGGAAATCC GTCTGGCGAA AATTGCTCAC GAACTGA 347 



2) INFORMATION FOR SEQ ID NO: 2279 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear, 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia 

(B) STRAIN: ATCC 43886 

(xi) SEQUENCE DESCRIPTION: SEQ 

AGAGCTCCTA TCTGGATTAT GCGATGTCGG 
CCAGATGTCC GAGATGGCCT GAAGCCGGTA 
CATGAACGTA CTAGGCAATG ACTGGAACAA 
GTGTCGTTGG TGACGTAATC GGTAAATACC 
GTCTATGACA CGATCGTCCG CATGGCGCAG 
GCTGGTAGAC GGTCAGGGTA ACTTCGGTTC 
CGGCAATGCG TTATACGGAA ATCCGTCTGG 
ATGGCCGATC TC 



coli 



ID NO: 2279 

TCATTGTTGG CCGTGCGCTG 50 

CACCGTCGCG TACTTTACGC 100 

AGCCTATAAA AAATCTGCCC 150 

ATCCCCATGG TGACTCGGCG 2 00 

CCATTCTCGC TGCGTTATAT 250 

TATCGACGGC GACTCTGCGG 3 00 

CGAAAATTGC CCATGAACTG 3 50 

362 



2) INFORMATION FOR SEQ ID NO: 2280 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2280 

CTCCTATCTG GATTATGCGA TGTCGGTCAT TGTTGGCCGT GCGCTGCCAG 50 

ATGTCCGAGA TGGCCTGAAG CCGGTACACC GTCGCGTACT TTACGCCATG 100 

AACGTACTAG GCAATGACTG GAACAAAGCC TATAAAAAAT CTGCCCGTGT 150 

CGTTGGTGAC GTAATCGGTA AATACCATCC CCATGGTGAC TCGGCGGTTT 200 

ATGACACGAT CGTCCGTATG GCGCAGCCAT TCTCGCTGCG TTACATGCTG 250 

GTAGACGGTC AGGGTAACTT CGGTTCCATC GACGGCGACT CTGCGGCGGC 300 

AATGCGTTAT ACGGAAATCC GTCTGGCGAA AATTGCCCAT GAACTGATGG 3 50 

CCGATCTC 358 
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2) INFORMATION FOR SEQ ID NO: 2281 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 2281 
CCCCCAGCTG GGCGGCGGTA TCGATGGGGG 



2) INFORMATION FOR SEQ ID NO: 2282 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2282 
AGRRGCIMAR ATGTATGA 
2) INFORMATION FOR SEQ ID NO: 2283 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2283 
ATITATGAYG GKITTCAGAG GC 



2) INFORMATION FOR SEQ ID NO: 2284 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2284 
TCTGWGTRAC IGGYTCKGAG A 



2) INFORMATION FOR SEQ ID NO: 2285 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2285 
CMCCICCWGG TGGWGAWAC 



2) INFORMATION FOR SEQ ID NO: 2286 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2286 
AGTTGCTGTA TTAGGAAATG 



2) INFORMATION FOR SEQ ID NO: 2287 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2287 
TCGAAGTTGC TGTATTAGGA 



2) INFORMATION FOR SEQ ID NO: 2288 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1240 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: BM4339 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2288 

TAGAAGCTGG CTCGTTTTTT TATAAATAAG TTATTCGTTT ATTTTTGTTT 50 

TGTGCTAAAA TATGAGAGTA AATCACTGAA CGATTTAGAA TACAGGAGGA 100 

CAATCTTTTG AAGATTACTT TACTATATGG CGGACGCAGC GCAGAGCAGA 150 

GCATGAAGTG TCCATTCTTT CCGCATTTTC AGTTTTAAAT GCCATTTATT 200 

ATAATTATTA CCAAGTTCAA CTCGTATTTA TTACAAAAGA AGGACAATGG 250 

GTCAAAGGTC CATTACTAAC AGAAAAACCT GCCAGCAAAG ATGTCTTGCA 3 00 

TCTTTCATGG GACCCAAGTG GACAGACAGA GGAAGGCTTT ACAGGAAAAG 350 

TGATCAATCC GGGCGAAATC AAAGAAGAAG GAGCCATCGT TTTTCCAGTT 4 00 

TTACATGGGC CAAACGGGGA AGATGGAACG ATCCAAGGCT TCTTAGAGAC 450 

ATTGAATATG CCTTATGTCG GCGCAGGCGT ATTGACCAGT GCATGTGCCA 500 

TGGATAAAAT CATGACCAAG TATATTTTAC AAGCTGCTGG TGTGCCGCAA 550 

GTTCCTTATG TACCAGTACT TAAGAATCAA TGGAAAGAAA ATCCTAAAAA 6 00 

AGTATTTGAT CAATGTGAAG GTTCTTTGCT TTATCCGATG TTTGTCAAAC 6 50 

CGGCGAATAT GGGTTCTAGT GTCGGCATTA CAAAAGCAGA AAACCGAGAA 700 

GAGCTGCAAA ATGCTTTAGC AACAGCCTAT CAGTATGATT CTCGAGCAAT 750 

CGTTGAACAA GGAATTGAAG CGCGCGAAAT CGAAGTTGCT GTATTAGGAA 800 

ATGAAGACGT TCGGACGACT TTGCCTGGTG AAGTCGTAAA AGACGTAGCA 850 

TTCTATGATT ATGAAGCAAA ATATATCAAT AATAAAATCG AAATGCAGAT 900 

TCCAGCCGAA GTGCCAGAAG AAGTTTATCA AAAAGCGCAA GAGTACGCGA 950 

AGTTAGCTTA CACGATGTTA GGTGGAAGCG GATTGAGCCG GTGCGATTTC 1000 

TTTTTGACAA ATAAAAATGA ATTATTCCTG AATGAATTAA ACTCTATGCC 1050 

AGGATTTACG GAGTTCAGTA TGTACCCACT CTTATGGGAA AATATGGGCT 1100 

TGAAATACGG TGATTTGATT GAAGAACTGA TCCAGTTAGG AATGAATCGA 1150 

TACCATCAGC GTCAATCTTT TTTTGAAAAA AATGAATAAA GAGAAATAAA 12 00 

GAAGAGGCTG GAGTGATTGC GTAACCGCGT TCATTCTAGC 1240 



2) INFORMATION FOR SEQ ID NO: 2289 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2289 
CACCGAAGAA GATGAAAAAA 
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2) INFORMATION FOR SEQ ID NO: 2290 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ , ID NO: 2290 
TGGCACCGAA GAAGATGA 18 



2) INFORMATION FOR SEQ ID NO: 2291 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2291 
ATTTTGGCAC CGAAGAAGA 19 
2) INFORMATION FOR SEQ ID NO: 2292 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2292 
GAATCGGCAA GACAATATG 19 



2) INFORMATION FOR SEQ ID NO: 2293 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecivm 

(B) STRAIN: BM4147 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2293 

ATGAATAGAA TAAAAGTTGC AATACTGTTT GGGGGTTGCT CAGAGGAGCA 50 

TGACGTATCG GTAAAATCTG CAATAGAGAT AGCCGCTAAC ATTAATAAAG 100 

AAAAATACGA GCCGTTATAC ATTGGAATTA CGAAATCTGG TGTATGGAAA 150 

ATGTGCGAAA AACCTTGCGC GGAATGGGAA AACGACAATT GCTATTCAGC 200 

TGTACTCTCG CCGGATAAAA AAATGCACGG ATTACTTGTT AAAAAGAACC 2 50 

ATGAATATGA AATCAACCAT GTTGATGTAG CATTTTCAGC TTTGCATGGC 300 

AAGTCAGGTG AAGATGGATC CATACAAGGT CTGTTTGAAT TGTCCGGTAT 350 

CCCTTTTGTA GGCTGCGATA TTCAAAGCTC AGCAATTTGT ATGGACAAAT 4 00 

CGTTGACATA CATCGTTGCG AAAAATGCTG GGATAGCTAC TCCCGCCTTT 4 50 

TGGGTTATTA ATAAAGATGA TAGGCCGGTG GCAGCTACGT TTACCTATCC 500 

TGTTTTTGTT AAGCCGGCGC GTTCAGGCTC ATCCTTCGGT GTGAAAAAAG 550 

TCAATAGCGC GGACGAATTG GACTACGCAA TTGAATCGGC AAGACAATAT 600 

GACAGCAAAA TCTTAATTGA GCAGGCTGTT TCGGGCTGTG AGGTCGGTTG 6 50 

TGCGGTATTG GGAAACAGTG CCGCGTTAGT TGTTGGCGAG GTGGACCAAA 700 

TCAGGCTGCA GTACGGAATC TTTCGTATTC ATCAGGAAGT CGAGCCGGAA 750 

AAAGGCTCTG AAAACGCAGT TATAACCGTT CCCGCAGACC TTTCAGCAGA 800 

GGAGCGAGGA CGGATACAGG AAACGGCAAA AAAAATATAT AAAGCGCTCG 850 

GCTGTAGAGG TCTAGCCCGT GTGGATATGT TTTTACAAGA TAACGGCCGC 900 

ATTGTACTGA ACGAAGTCAA TACTCTGCCC GGTTTCACGT CATACAGTCG 950 

TTATCCCCGT ATGATGGCCG CTGCAGGTAT TGCACTTCCC GAACTGATTG 1000 

ACCGCTTGAT CGTATTAGCG TTAAAGGGGT GA 1032 



2) INFORMATION FOR SEQ ID NO: 2294 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2294 
AAACGAGGAT GATTTGATTG 



2) INFORMATION FOR SEQ ID NO: 2295 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2295 
TTGAGCAAGC GATTTCGG 



2) INFORMATION FOR SEQ ID NO: 2296 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1029 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: V583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2296 

ATGAATAAAA TAAAAGTCGC AATTATCTTC GGCGGTTGCT CGGAGGAACA 50 

TGATGTGTCG GTAAAATCCG CAATAGAAAT TGCTGCGAAC ATTAATACTG 100 

AAAAATTCGA TCCGCACTAC ATCGGAATTA CAAAAAACGG CGTATGGAAG 150 

CTATGCAAGA AGCCATGTAC GGAATGGGAA GCCGATAGTC TCCCCGCCAT 200 

ATTCTCCCCG GATAGGAAAA CGCATGGTCT GCTTGTCATG AAAGAAAGAG 250 

AATACGAAAC TCGGCGTATT GACGTGGCTT TCCCGGTTTT GCATGGCAAA 300 

TGCGGGGAGG ATGGTGCGAT ACAGGGTCTG TTTGAATTGT CTGGTATCCC 350 

CTATGTAGGC TGCGATATTC AAAGCTCCGC AGCTTGCATG GACAAATCAC 400 

TGGCCTACAT TCTTACAAAA AATGCGGGCA TCGCCGTCCC CGAATTTCAA 450 

ATGATTGAAA AAGGTGACAA ACCGGAGGCG AGGACGCTTA CCTACCCTGT 500 

CTTTGTGAAG CCGGCACGGT CAGGTTCGTC CTTTGGCGTA ACCAAAGTAA 550 

ACAGTACGGA AGAACTAAAC GCTGCGATAG AAGCAGCAGG ACAATATGAT 600 

GGAAAAATCT TAATTGAGCA AGCGATTTCG GGCTGTGAGG TCGGCTGCGC 650 

GGTCATGGGA AACGAGGATG ATTTGATTGT CGGCGAAGTG GATCAAATCC 700 

GGTTGAGCCA CGGTATCTTC CGCATCCATC AGGAAAACGA GCCGGAAAAA 750 

GGCTCAGAGA ATGCGATGAT TATCGTTCCA GCAGACATTC CGGTCGAGGA 800 

ACGAAATCGG GTGCAAGAAA CGGCAAAGAA AGTATATCGG GTGCTTGGAT 850 

GCAGAGGGCT TGCTCGTGTT GATCTTTTTT TGCAGGAGGA TGGCGGCATC 900 

GTTCTAAACG AGGTCAATAC CCTGCCCGGT TTTACATCGT ACAGCCGCTA 950 

TCCACGCATG GCGGCTGCCG CAGGAATCAC GCTTCCCGCA CTAATTGACA 1000 

GCCTGATTAC ATTGGCGATA GAGAGGTGA 102 9 



2) INFORMATION FOR SEQ ID NO: 2297 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2297 
TTCAGGAGGG GGATCGC 
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